This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



I 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 



(43) International Publication Date 
8 November 2001 (08.11.2001) PCT 



(10) International Publication Number 

WO 01/83729 A2 



(51) International Patent Classification 7 : C12N 15/00 

(21) International Application Number: PCT/EP0 1/04863 

(22) International Filing Date: 30 April 2001 (30.04.2001) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

09/562,934 1 May 2000 (01 .05.2000) US 

(71) Applicants: NOVARTIS AG [CH/CH]; Schwarzwal- 
dallee 215, CH^058 Basel (CH). THE SCRIPPS 
RESEARCH INSTITUTE [US/US] f 1 0550 North Torrey 
Pines Road, La Jolla, CA 92037 (US).' 

(71) Applicants and 

= (72) Inventors: NEMEROW, Glen, R. [US/US]; 462 Cerro 

j== Street, Encinitas, CA 92024 (US). VON SEGGERN, 

= Daniel, J. [US/US]; Apartment 30, 5175 Luigi Terrace, 

55 San Diego, CA 92122 (US). FRIEDLANDER, Marty 

= [US/US]; 1720 Zapo Street, Del Mar, CA 92014 (US). 



(74) Agent: BECKER, Konrad; Novartis AG, Corporate In- 
tellectual Property, Patent & Trademark Dept., CH-4002 
Basel (CH). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, IT, TZ, UA, UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, Fl, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

On 
<s 

m 

go 

i-H (54) Title: VECTORS FOR OCULAR TRANSDUCTION AND USE THEREOF FOR GENETIC THERAPY 

(57) Abstract: Adenovirus vector-based gene therapy methods for treating ocular disorders are provided Adenovirus vectors for 
^ therapy of ocular diseases and methods of treatment using the vectors are provided. Compositions, kits, and methods of preparation 
>^ and use of the vectors for gene therapy are provided. 



WO 01/83729 



PCT/EP01/04863 



VECTORS FOR OCULAR TRANSDUCTION AND USE THEREOF FOR GENETIC 

THERAPY 

Work described herein was supported by NIH grants EY11431 and 

HL54352. The government has certain rights in such subject matter. 
5 RELATED APPLICATIONS 

This application claims the benefit of priority to U.S. application Serial No. 

09/562,934, filed May 1, 2000, to Glen R. Nemerow, Daniel Von Seggern,; 

Martin Friedlander, entitled "VECTORS FOR OCULAR TRANSDUCTION AND USE 

THEREFOR FOR GENETIC THERAPY". 
10 This application is related to copending U.S. application Serial 

No. 09/482,682 (also filed as International PCT application No. 

PCT/USOO/00265, filed January 14, 2000)), to Daniel Von Seggern, Glen R. 

Nemerow, Paul Hallenbeck, Susan Stevenson, Yelena Skripchenko, filed January 

1.4, 2000, entitled "Adenovirus Vectors, Packaging Cell Lines,Compositions, and 
15 Methods for Preparation and Use," which is a continuation-in-part of U.S. 

Application 09/423,783 filed November 1 2, 1 999 and claims the benefit of the 

filing date of U.S. Provisional Application 60/1 15,920 filed January 14, 1999. 

Where permitted, the contents and subject matter of each application and of the 

provisional application are incorporated in their entirety herein by reference. 
20 FIELD OF INVENTION 

The present invention relates to gene therapy, especially to adenovirus 

vector-based gene therapy. In particular, adenovirus vectors for therapy of 

ocular diseases and methods of treatment using the vectors are provided. 

Compositions, kits, and methods of preparation and use of the vectors for gene 
25 therapy are provided. 

BACKGROUND OF THE INVENTION 

Retinal dystrophies 

The eye is susceptible to a number of hereditary and/or age related 

degenerative disorders. In the United States, common causes of irreversible 
30 blindness or severe loss of vision are retinal dystrophies [see, e.g., Cotlier et aL 

(1995) Surv. Ophthalmology 40:51-61; Bird (1995) Am. J. Ophthal. 119: 

543-562; and Adler (1996) Arch Ophthal 1 74:79-83). The retina is the sensory 
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tunic of the eye, containing light sensitive receptors, a complex of neurons, and 
pigmented epithelium, arranged in discrete layers. In humans, the macula is the 
portion of the retina that lies directly behind the lens. Cones, the photoreceptor 
cells responsible for central vision, are heavily concentrated in the macula. 
5 Central dystrophies, which affect the macula, include Best's disease, age-related 
macular degeneration, and Stargardt's macular dystrophy. The peripheral retina 
is composed mainly of rods, which are responsible for side and night vision. 
Peripheral degenerative retinal diseases include retinitis pigmentosa, choroidemia 
and Bietti's crystalline dystrophy. 

10 Macular degenerations are a heterogenous group of diseases, 

characterized by progressive central vision loss and degeneration of the macula 
and underlying retinal pigmented epithelium. Age-related macular degeneration <• 
(ARMD) is the most common form of the disease, affecting an estimated 20% oP 
persons over 75 years of age. ARMD is poorly understood in terms of etiology 

15 and pathogenesis. The very iate onset of the disease has made genetic mapping 
particularly difficult. Certain macular degenerative conditions with a clear genetic 
basis, such as Stargardt's and Best's diseases, share many features with ARMD, 
but have been more amenable to molecular and genetic analysis. 

Hereditary peripheral retinopathies are also relatively common. Retinitis 

20 pigmentosa (RP), for example, affects approximately 1 .5 million people 
worldwide. Substantial genetic heterogeneity has been observed in this 
condition, with over 20 chromosomal loci identified. A predisposition to retinitis 
pigmentosa can be inherited by autosomal dominant, autosomal recessive, 
X-linked or digenic mode. Mutations have been identified in seven genes, four of 

25 which encode proteins in the rod phototransduction cascade: rhodopsin, alpha 
and beta subunits of rod cGMP phosphodiesterase, and rod cGMP cation-gated 
channel protein .alpha, subunit. Mutations in the peripherin/RDS gene have been 
linked to retinitis pigmentosa and macular degeneration. A single peripherin/RDS 
mutation apparently caused retinitis pigmentosa, pattern dystrophy and fundus 

30 flavimaculatus, in different family members. 
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In spite of causal heterogeneity, there is significant clinical similarity 
among RP subtypes. Common signs and symptoms include early 
electroretinographic abnormalities, ophthalmoscopic findings, and 
protracted, contiguous expansion of the ring-like scotoma toward the macula, 
5 leading to progressively worsening tunnel vision. A recent hypothesis is that 
active photoreceptor cell death, which is characteristic of these genetically 
distinct disorders, is mediated by a common induction of apoptosis. It may be 
possible to treat these conditions by the administration of agents that block 
induction of apoptosis in photoreceptors, such as neurotrophic factors. 

10 Adenovirus delivery vectors 

Adenovirus, which is a DNA virus with a 36 kilobase (kb) genome, is very 
well-characterized and its genetics and genetic^orgariization are understood. 
The genetic organization of adenoviruses permits substitution of large fragments 
of viral DNA with foreign DNA. In addition, recombinant adenoviruses are 

15 structurally stable and no rearranged viruses are observed after extensive 
amplification. 

Adenoviruses have been employed as delivery vehicles for introducing 
desired genes into eukaryotic cells. The adenovirus delivers such genes to 
eukaryotic cells by binding to cellular receptors followed by internalization. The 

20 adenovirus fiber protein is responsible for binding to cells. The fiber protein has 
two domains, a rod-like shaft portion and a globular head portion that contains 
the receptor binding region. The fiber spike is a homotrimer, and there are 1 2 
spikes per virion. Human adenoviruses bind to and infect a broad range of 
cultured cell lines and primary tissues from different species. 

25 The 35,000+ base pair (bp) genome of adenovirus type 2 has been 

sequenced and the predicted amino acid sequences of the major coat proteins 
(hexon, fiber and penton base) have been described (see, e.g., Neumann et aL, 
Gene 69: 153-157 (1988); Herisse eta/., Nuc. Acids Res. 9: 4023-4041 (1981); 
Roberts et aL, J. Biol. Chem. 259: 13968-13975 (1984); Kinloch et aL, J. Biol. 

30 Chem. 259: 6431-6436 (1984); and Chroboczek et aL, Virol. 161: 549-554, 
1987). 
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The 35,935 bp sequence of Ad 5 DNA is also known and portions of 
many other adenovirus genomes have been sequenced. The upper packaging 
limit for adenovirus virions is about 105% of the wild-type genome length (see, 
e.g., Bett, et al., J. Virol. 67(10): 591 1-21,. 1993). Thus, for Ad2 and Ad5, this 
5 would be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats permit single strands of viral DNA to circularize by base-pairing 
of their terminal sequences to form base-paired "panhandle" structures that are 

10 required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal {a few hundred bp in 
f-3 ?rl0ngtb). comprise the "minimum requirement" for replication and packaging' of ^ - •» 

*pH ^genomic nucleic acid into an adenovirus particle. Helper-dependent vefetbrfe*;*^ 
lacking all viral ORFs but including these essential cis elements (the ITRs and 

15 contiguous packaging sequence) have been constructed. 

Ad vectors have several distinct advantages as gene delivery vehicles. 
For example, recombination of such vectors is rare; there are no known 
associations of human malignancies with adenoviral infections despite common 
human infection with adenoviruses; the genome may be manipulated to 

20 accommodate foreign genes of a fairly substantial size; and host proliferation is 
not required for expression of adenoviral proteins. Adenovirus (Ad)-based gene 
delivery vectors efficiently infect many different cells and tissues. This broad 
tropism, however, means that gene delivery cannot be directed to a specific 
target cell. A large fraction of intravenously administered adenovirus is 

25 retained by the liver, which could lead to undesirable side-effects. Adenovirus 
may potentiate immune responses. For example, Adenovirus type 5 (Ad5) also 
transduces dendritic cells, which present antigens very efficiently, thereby 
possibly exacerbating the immune response against the vector. It has been 
proposed that vectors with different targeting efficiencies might eliminate these 

30 problems, permitting a lower total particle dose and more sp cific targeting (see, 
e.g., U.S. application Serial No. 09/482,682). 
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The wealth of information on adenovirus structure and mechanism of 
infection, its efficient infection of nondividing cells, and its large genetic capacity 
make adenovirus a popular gene therapy vector. The wide expression of 
receptors to which adenovirus binds makes targeting adenovirus vectors 
5 difficult. 

Hence there is a need to improve delivery and targeting of adenoviral 
vectors and also to provide treatments for ocular disorders. Therefore, it is an 
object herein to provide adenoviral vectors that specifically or selectively target 
cells in the eye. It is also an object herein to provide these vectors for treatment 

10 of ocular disorders. 

SUMMARY OF THE INVENTION 

Degenerative ocular diseases,* such as, but not limited to, retinitis 
pigmentosa, Stargardt's diseased-diabetic retinopathies, retinal vascularization, 
and others (see, e.g., Table below), have a genetic basis. Genes expressed in 

15 the photoreceptor cells at the back of the retina are implicated in these diseases. 
Provided herein are recombinant viral vectors for targeting therapeutic products 
to these cells. 

Recombinant adenoviral vectors that include nucleic acid that permits 
specific binding to these photoreceptors are provided. In particular, the vector 

20 particles contain a fiber protein of Ad37 or a modified form thereof. As shown 
herein, fiber protein from Ad37 permits efficient infection of photoreceptor cells. 
Fiber proteins from other adenovirus D serotypes may also be used. In addition, 
the portions of the fiber protein, particularly those that interact with other viral 
structural proteins, such as penton, may be modified to resemble the viral source 

25 of the other structural proteins. As exemplified herein, the recombinant virus 
provided herein include Ad 5 structural components. The N-terminus of the Ad37 
fiber protein, which interacts with the penton protein, is modified to resemble 
the Ad 5 fiber protein N-terminus to ensure production of viral particles. 

The recombinant adenoviral vectors are intended for gene therapy of 

30 diseases in which genes expressed in the photoreceptors are implicated. Such 
diseases include, but are not limited to, degenerative ocular diseases, such as 
retinitis pigmentosa and Stargardt's disease. These vectors are also useful for 
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targeting to other ocular cells, such as conjunctival cells, which also bear 
receptors to which fiber from Ad37 and related serotypes bind. 

The vectors will deliver therapeutic agents to the targeted cells for 
treatment of a variety of disorders (see e.g., Tables 3 and 4, below)). The 
5 therapeutic agents are intended for expression in the photoreceptors and for 
secretion from the photorecptor cells, which are surrounded on one side by 
choroidal vasculature, and on the other side by retinal vasulature, thereby 
providing a means for delivery of products. In addition, expression of growth 
factors, such as VEGF and others, can be used to enhance blood flow to the 
10 retina and prevent or slow the degeneration. 

Therapeutic agents encoded by the recombinant adenoviral vectors' ^ 
' ft-ji&h' include, but are not limited to, nucleic acid nucleic acid molecules/encoding* 
'k0^t^ : genes that are defective in certain hereditary disorders, nucleic acid molecules * 
that encode antiangiogenics and antitumor agents for treatment of retinal 
1 5 disorders, such as retinoblastomas; nucleic acid molecules encoding trophic 
factors, such as glial cell line-derived neuroptrophic factor (GDNF) and ciliary 
neurotrophic factor (CNTF), growth factors and growth factor inhibitors, 
antiapoptotic factors, such as Bcl-2 (CNTF), antitumor agents, anti-angiogenics, 
and genes or portions thereof for gene replacement or repair of defective genes. 
20 Hence, methods for treatment of inherited and acquired retinal diseases, 

including diseases involving neovascular and vascular degeneration are provided. 

Methods for treating diseases involving genes expressed in photoreceptor 
cells are provided herein. The methods provided herein are practiced by 

25 administration of the recombinant viral vectors by any means suitable for 

delivery to the photoreceptors. A preferred mode of administration is intraocular 
injection including intravitreal and subretinal injection. Other modes of 
administration include, but are not limited to, intrascleral, periorbital and 
intravenous administration. The vectors also can include photoreceptor-specific 

30 promoters thereby providing a means, not only for specific targeting of 
expression in these cells, but also for photoreceptor-restricted transgene 
expression. 
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DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
5 which this invention belongs. All patents, applications, published applications 
and other publications and sequences from GenBank and other data bases 
referred to anywhere in the disclosure herein are incorporated by reference in 
their entirety. 

As used herein, the amino acids, which occur in the various amino acid 
10 sequences appearing herein, are identified according to their three-letter or one- 
letter abbreviations. The nucleotides, which occur in the various DNA 
fragments, are designated^with, the standard single-letter designations used 
routinely in the art^seev^Table 1). 

As used herein, amino acid residue refers to an amino acid formed upon 
15 chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The 
1 amino acid residues described herein are preferably in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group present at the amino terminus 
20 of a polypeptide. COOH refers to the free carboxy group present at the carboxyl 
terminus of a polypeptide. In keeping with standard polypeptide nomenclature 
described in J. Biol. Chem., 245:3552-59 (1969) and adopted at 37 C.F.R. § § 
1 .821 - 1.822, abbreviations for amino acid residues are shown in the following 
Table: 

25 Table 1 



Table of Correspondence 



SYMBOL 




1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 
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SYMBOL 




A 

A 


Ala 


alanine 


o 


oer 


corino 

serine 


1 


Me 


isoieucine 


1 

L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine^ 


t 


OlU 


glutamic acid 


z 


GIx 


uiu cJiiu/Ui uin 


w 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



It should be noted that all amino acid residue sequences represented, 
herein by formulae have a left to right orientation in the conventional direction of 
amino-terminus to carboxyl-terminus. In addition, the phrase "amino acid 
residue" is broadly defined to include the amino acids listed in the Table of 
25 Correspondence and modified and unusual amino acids, such as those referred 
to in 37 C.F.R. § § 1,821-1.822, and incorporated herein by reference. 
Furthermor , it should be noted that a dash at the beginning or end of an amino 
acid residue sequence indicates a peptide bond to a further sequence of one or 
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more amino acid residues or to an amino-terminal group such as NH 2 or to a 
carboxyl-terminal group such as COOH. 

In a peptide or protein, suitable conservative substitutions of amino acids 
are known to those of skill in this art and may be made generally without altering 
5 the biological activity of the resulting molecule. Those of skill in this art 
recognize that, in general, single amino acid substitutions in non-essential 
regions of a polypeptide do not substantially alter biological activity (see, e.g.. 
Watson et ah Molecular Biology of the Gene, 4th Edition, 1 987, The 
Bejacmin/Cummings Pub. co., p. 224). 
10 Such substitutions are preferably made in accordance with those set forth 

in TABLE 2 as follows:? 

* t-K&^^w^' TABLE 2 

ii Original residue ' Conservative substitution 

----- 'Ala (A) Gly;Ser 

15 Arg (R) Lys 

Asn (N) Gin; His 

Cys (C) Ser 

Gin (Q) Asn 

Glu (E) Asp 

20 Gly (G) Ala; Pro 

His (H) Asn; Gin 

lie (I) Leu; Val 

Leu (L) lie; Val 

Lys (K) Arg; Gin; Glu 

25 Met (M) Leu; Tyr; lie 

Phe (F) Met; Leu; Tyr 

Ser (S) Thr 

Thr (T) Ser 

Trp (W) Tyr 

30 Tyr (Y) Trp; Phe 

Val (V) lie; Leu 

Other substitutions are also permissible and may be determined empirically or in 
accord with known conservative substitutions. 
35 As used herein, a complementing plasmid describes plasmid vectors that 

deliver nucleic acids into a packaging cell line for stable integration into a 
chromosome in the cellular genome. 

As used herein, a delivery plasmid is a plasmid vector that carries or 
delivers nucleic acids encoding a therapeutic gene or gene that encodes a 
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therapeutic product or a precursor thereof or a regulatory gene or other factor 
that results in a therapeutic effect when delivered in vivo in or into a cell line, 
such as, but not limited to a packaging cell line, to propagate therapeutic viral 
vectors. 

5 As used herein, a variety of vectors with different requirements are 

described. For example, one vector is used to deliver particular nucleic acid 
molecules into a packaging cell line for stable integration into a chromosome. 
These types of vectors are generally identified herein as complementing 
plasmids. A further type of vector described herein carries or delivers nucleic 

10 acid molecules in or into a cell line [e.g., a packaging cell line) for the purpose of 
propagating therapeutic viral vectors; hence, these vectors are generally referred 
, to herein as delivery plasmids. A third "typ9%of vector described herein is used 
to carry nucleic acid molecules encodingVtherapputic proteins or polypeptides or 
regulatory proteins or are regulatory sequences to specific cells or cell types in a 

1 5 subject in need of treatment; these vectors are generally identified herein as 
therapeutic viral vectors or recombinant adenoviral vectors or viral Ad-derived . 
vectors and are in the form of a virus particle encapsulating a viral nucleic acid 
containing an expression cassette for expressing the therapeutic gene. 

As used herein, a DNA or nucleic acid homolog refers to a nucleic acid 

20 that includes a preselected conserved nucleotide sequence, such as a sequence 
encoding a therapeutic polypeptide. By the term "substantially homologous" is 
meant having at least 80%, preferably at least 90%, most preferably at least 
95% homology therewith or a lesser percentage of homology or identity and 
conserved biological activity or function. 

25 The terms "homology" and "identity" are often used interchangeably. In 

this regard, percent homology or identity may be determined, for example, by 
comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch (J. Mol. Biol 
48:443 (1970), as revised by Smith and Waterman (Adv. AppL Math. 2:482 

30 (1981). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i. nucleotides or amino acids) which are similar, divided by the total 
number of symbols in the shorter of the two sequences. The preferred default 
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i 

parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 
5 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1979); (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 
symbol in each gap; and (3) no penalty for 
end gaps. 

Whether any two nucleic acid molecules have nucleotide sequences that 

10 are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined 2 using known computer algorithms such as the "FAST A" program, * 
musing* for example, the default parameters as in Pearson and Lipman, ProcJ Native 
WWci&ScL USA 55:2444 (1988). Alternatively the BLAST function of th^'^'^^r 
National Center for Biotechnology Information database may be used to 

1 5 determine identity. 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 

20 and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data r Part I, Griffin, A.M., and Griffin, H.G., 
eds., Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). 

25 While there exist a number of methods to measure identity between two 

polynucleotides or polypeptide sequences, the term "identity" is well known to 
skilled artisans {Carillo, H. & Upton, D., S/AM J Applied Math 45:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 

30 Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo, H. & Lipton, D., SIAM J Applied Math 45:1073 (1988). Methods to 
determine identity and similarity ar codified in computer programs. Pref rred 
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computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., 
et al, Nucleic Acids Research 12(l):387 (1 984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., etal.,JMoiec Biol 275:403 (1990)). 
5 Therefore, as used herein, the term "identity" represents a comparison 

between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 
identical to a reference polypeptide. As used herein, the term at least "90% 
identical to" refers to percent identities from 90 to 99.99 relative to the 

10 reference polypeptides. Identity at a level of 90% or more, is indicative of the 
fact that, assuming for exemplification purposes a test and reference 
polynucleotide length of 1 00' aminoacids are compared. No<morethan 10% 
(i.e., 10 out of 100) amino acidsUn^the test polypeptide differs from that of the 
reference polypeptides. Similar comparisons may be made between a test and 

15 reference polynucleotides. Such differences may be represented as point 

mutations randomly distributed over the entire length of an amino acid sequence 
or they may be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 90% 
identity). Differences are defined as nucleic acid or amino acid substitutions, or 

20 deletions. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 
into the selected target cells in a manner such that the heterologous DNA is 

25 expressed and a therapeutic product encoded thereby is produced. 

Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 
of a therapeutic product. Genetic therapy may also be used to deliver nucleic 

30 acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is intr duced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
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factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
5 prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, heterologous DNA is DNA that encodes RNA and proteins 
that are not normally produced in vivo by the cell in which it is expressed or that 
mediates or encodes mediators that alter expression of endogenous DNA by 
10 affecting transcription, translation, or other regulatable biochemical processes. 
\ ^Heterologous DNA may also be referred to as foreign DNA. Any DNA that Srie 
« T-f; of skill in the art would recognize or consider as heterologous or foreign to tfie 
^ cell in which it is expressed is herein encompassed by heterbio§ous%NA. 

Examples of heterologous DNA include, but are not limited to, DNA that encodes 
15 traceable marker proteins, such as a protein that confers drug resistance, DNA 
that encodes therapeutically effective substances, such as anti-cancer agents, 
enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. Antibodies that are encoded by heterologous DNA may be secreted 
or expressed on the surface of the cell in which the heterologous DNA has been 
20 introduced. 

Hence, herein heterologous DNA or foreign DNA, refers to a DNA 
molecule not present in the exact orientation and position as the counterpart 
DNA molecule found in the corresponding wild-type adenovirus. It may also 
refer to a DNA molecule from another organism or species (i.e., exogenous) or 
25 from another Ad serotype. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 
30 Typically, DNA ncoding the desired heterologous DNA is cloned into a 

plasmid vector and introduced by routine methods, such as calcium-phosphate . 
mediated DNA uptak (se , (19811 Somat. Cell. Mol. Genet. 7:603-616) or 
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microinjection, into producer cells, such as packaging cells. After amplification 
in producer cells, the vectors that contain the heterologous DNA are introduced 
into selected target cells. 

As used herein, an expression or delivery vector refers to any plasmid or 
5 virus into which a foreign or heterologous DNA may be inserted for expression in 
a suitable host cell — i.e., the protein or polypeptide encoded by the DNA is 
synthesized in the host cell's system. Vectors capable of directing the 
expression of DNA segments (genes) encoding one or more proteins are referred 
to herein as "expression vectors." Also included are vectors that allow cloning 

10 of cDNA (complementary DNA) from mRNAs produced using reverse 
transcriptase. < ^ 

As used herein,.^ gene is a nucleic acid molecule whose nucleotide 
sequence encodes^RNA^ gene can be either RNA or DNA. 

Genes may include regions preceding and following the coding region (leader and 

1 5 trailer) as well as intervening sequences (introns) between individual coding 
segments (exons). 

As used herein, tropism with reference to an adenovirus refers to the 
selective infectivity or binding that is conferred on the particle by the fiber 
protein, such as by the C-terminus portion that comprises the knob. 

20 As used herein, isolated with reference to a nucleic acid molecule or 

polypeptide or other biomolecule means that the nucleic acid or polypeptide has 
separated from the genetic environment from which the polypeptide or nucleic 
acid were obtained. It may also mean altered from the natural state. For 
example, a polynucleotide or a polypeptide naturally present in a living animal is 

25 not "isolated/ but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed 
herein. Thus, a polypeptide or polynucleotide produced and/or contained within 
a recombinant host cell is considered isolated. Also intended as an "isolated 
polypeptide" or an "isolated polynucleotide" are polypeptides or polynucleotides 

30 that have been purified, partially or substantially, from a recombinant host cell or 
from a native source. For example, a recombinant^ produced version of a 
compound can be substantially purified by the one-step method described in 
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Smith and Johnson, Gene 57/31-40 (1988). The terms isolated and purified are 
sometimes used interchangeably. 

Thus, by "isolated" is meant that the nucleic acid is free of the coding 
sequences of those genes that, in the naturally-occurring genome of the 
5 organism (if any) immediately flank the gene encoding the nucleic acid of 

interest. Isolated DNA may be single-stranded or double-stranded, and may be 
genomic DNA, cDNA, recombinant hybrid DNA, or synthetic DNA. It may be 
identical to a native DNA sequence, or may differ from such sequence by the 
deletion, addition, or substitution of one or more nucleotides. 

10 Isolated or purified as it refers to preparations made from biological cells 

or hosts means any cell extract containing the indicated DNA or protein including 
a crude extract of the DNA or protein of interest. For exampleHn^the case of a 
protein, a purified preparation can be obtained follbwihg%n individual technique 
or a series of preparative or biochemical techniques and the DNA or protein of 

15 interest can be present at various degrees of purity in these preparations. The 
procedures may include for example, but are not limited to, ammonium sulfate 
fractionation, gel filtration, ion exchange chromatography, affinity 
chromatography, density gradient centrifugation and electrophoresis. 

A preparation of DNA or protein that is "substantially pure" or "isolated" 

20 should be understood to mean a preparation free from naturally occurring 
materials with which such DNA or protein is normally associated in nature. 
"Essentially pure" should be understood to mean a "highly" purified preparation 
that contains at least 95% of the DNA or protein of interest. 

A cell extract that contains the DNA or protein of interest should be 

25 understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term 
"cell extract" is intended to include culture media, especially spent culture media 
from which the cells have been removed. 

As used herein, a packaging cell line is a cell line that provides a missing 

30 gene product or its equivalent. 

As used her in, an adenovirus viral particle is the minimal structural or 
functional unit of a virus. A virus can refer to a single particle, a stock of 
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particles or a viral genome. The adenovirus (Ad) particle is relatively complex 
and may be resolved into various substructures. 

As used herein, "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
5 also be used to indicate penton base, as well as penton complex. The meaning 
of the term "penton" alone should be clear from the context within which it is 
used. 

As used herein, a plasmid refers to an autonomous self-replicating 
extrachromosomal circular nucleic acid molecule, typically DNA. 

10 As used herein, a post-transcription regulatory element (PRE) is a 

regulatory element found in viral or cellular messenger RNA that is not spliced, ; * 

i.e. introriless* messages. Examples include, but are not limited to, human * 
hepatiti^Virus; woddchuck hepatitis virus, the TK gene and mouse histone gene* 
The PRE may be placed before a polyA sequence and after a heterologous DNA 

15 sequence. 

As used herein, pseudotyping describes the production of adenoviral 
vectors having modified capsid protein or capsid proteins from a different 
serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 fiber protein. This may be 

20 accomplished by producing the adenoviral vector in packaging cell lines 
expressing different fiber proteins. 

As used herein, promoters of interest herein may be inducible or 
constitutive. Inducible promoters will initiate transcription only in the presence 
of an additional molecule; constitutive promoters do not require the presence of 

25 any additional molecule to regulate gene expression, a regulatable or inducible 
promoter may also be described as a promoter where the rate or extent of RNA 
polymerase binding and initiation is modulated by external stimuli. Such stimuli 
include, but are not limited to various compounds or compositions, light, heat, 
stress and chemical energy sources. Inducible, suppressible and repressible 

30 promot rs are considered regulatable promoters. Preferred promoters herein, are 
promot rs that are selectively expressed in ocular cells, particularly 
photoreceptor cells. 
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As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

5 As used herein, recombinant refers to any progeny formed as the result of 

genetic engineering. This may also be used to describe a virus formed by 
recombination of plasmids in a packaging cell. 

As used herein, a transgene or therapeutic nucleic acid molecule includes 
DNA and RNA molecules encoding an RNA or polypeptide. Such molecules may 

10 be "native" or naturally-derived sequences; they may also be "non-native" or 
"foreign" that are naturally- or recombinantly-derived. . Jhe term "transgene," 
ft . which may be used interchangeably herein witjh>^he^term "therapeutic nucleic 
acid molecule," is often used to describe^, heterologous or foreign (exogenous) 
gene that is carried by a viral vector and transduced into a host cell. 

15 Therefore, therapeutic nucleotide nucleic acid molecules include antisense 

sequences or nucleotide sequences which may be transcribed into antisense 
sequences. Therapeutic nucleotide sequences (or transgenes) all include nucleic 
acid molecules that function to produce a desired effect in the cell or cell nucleus 
into which said therapeutic sequences are delivered. For example, a therapeutic 

20 nucleic acid molecule can include a sequence of nucleotides that encodes a 
functional protein intended for delivery into a cell which is unable to produce 
that functional protein. 

As used herein, the vitreous of the eye refers to material that fills the 
chamber behind the lens of the eye (i.e., vitreous humor or vitreous body), 

25 As used herein, a promoter region refers to the portion of DNA of a gene 

that controls transcription of the DNA to which it is operatively linked. The 
promoter region includes specific sequences of DNA that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. This portion of the 
promoter region is referred to as the promoter. In addition, the promoter region 

30 includes sequences that modulate this recognition, binding and transcription 

initiation activity of the RNA polymerase. These sequences may be cfs acting or 
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may be responsive to trans acting factors. Promoters, depending upon the 
nature of the regulation, may be constitutive or regulated. 

Thus, promoters are nucleic acid fragments that contain a DNA sequence 
that controls the expression of a gene located 3' or downstream of the 
5 promoter. The promoter is the DNA sequence to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene, 
typically located 3' of the promoter. A promoter also includes DNA sequences 
that direct the initiation of transcription, including those to which RNA 
polymerase specifically binds. If more than one nucleic acid sequence encoding < 

10 particular polypeptide or protein is included in a therapeutic viral vector or 
nuc|eotide v s*equence, more than one promoter or enhancer element may be 
■•• i„g includfed,.,particularly if that, would enhance efficiency of expression. ^ 

A regulatable or inducible promoter may be described as £ promoter?? - 
wherein the rate of RNA polymerase binding and initiation is modulated by 

15 external stimuli, (see, e.g., U.S. Patent Nos. 5,750,396 and 5,998,205). Such 
stimuli include various compounds or compositions, light, heat, stress, chemical 
energy sources, and the like. Inducible, suppressible and repressible promoters 
are considered regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. 

20 Tissue-specific promoters direct the expression of the gene to which they are 
operably linked to a specific cell type. Tissue-specific promoters cause the gene 
located 3' of it to be expressed predominantly, if not exclusively, in the specific 
cells where the promoter expressed its endogenous gene. Typically, it appears 
that if a tissue-specific promoter expresses the gene located 3' of it at all, then 

25 it is expressed appropriately in the correct cell types (see, e.g., Palmiter et al. 
(1986) Ann. Rev. Genet. 20: 465-499). 

As used herein, the phrase "operatively linked" generally means the 
sequences or segments have been covalently joined into one piece of DNA, 
whether in single or double stranded form, whereby control sequences on one 

30 segment control expression or replication or other such control of other 
segments. The two segments are not necessarily contiguous. 
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As used herein, exogenous encompasses any therapeutic composition 
that is administered by the therapeutic methods provided herein. Thus, 
exogenous may also be referred to herein as foreign, or non-native or other 
equivalent expression. 
5 B. Ad37 fiber tropism 

The adenovirus fiber protein is a major determinant of adenovirus tropism 
(Ga\\ etal. (1996) J. Virol. 70:21 16-2123; Stevenson eta/. (1995) J. Virol. 
69:2850-2857). The fiber protein extends from the capsid and mediates viral 
binding to the cell surface by binding to specific cell receptors (Philipson et af. 

10 (1968) J. ViroL 2:1064-1075), The fiber is a trimeric protein that includes an N- 
terminal tail domain that interacts with the adenovirus penton base, a central 
shaft domain of varying length/ and- a f C-tefminal knob domain that contains the 
cell receptor binding site (Chrobbczek et al. (1995) \ Curr.Top.Microbiol.lmmunol. 
755:163-200; Riurok et al. (1990) J.Mol. Biol. 2/5:589-596; Stevenson et al. 

15 (1995) J. ViroL 65:2850-2857). Fiber proteins of most adenovirus subgroups 
have been shown to bind specifically or selectively to the 46 kDa coxsackievirus- 
adenovirus receptor (CAR), (Bergelson et al. (1997) Science 275:1320-1323; 
Roelvink etal. (1998) J. ViroL 72:7909-7915). CAR appears to be expressed in 
a variety of human tissues, including the lung, at various levels (Bergelson et al. 

20 (1997) Science 275:1320-1323), but Ad37 binds poorly to lung epithelial cells 
(Huang etal. (1999) J. ViroL 75:2798-2802). This suggests that the tropism of 
this serotype may be influenced by factors independent of CAR expression. 

Structural and biochemical data also suggest that distinct receptor binding 
sites are located on different regions of the Ad5 and Ad37 fiber knobs. 

25 Adopting the nomenclature of Xia et al. (Xia et al. (1 994) Structure 2:1 259- 

1 270), the receptor binding site for Ad 5 is located at the AB-loop on the side of 
the fiber knob (Bewley etal. (1999) Science 266:1579-1583; Roelvink et aL 
(1999) Science 266:1568-1571). It is known that a lysine residue at position 
240 of the Ad37 fiber, located in the CD-loop, is important for receptor binding 

30 (Huang etal. (1999) J. ViroL 75:2798-2802). The co-crystal structur of th 
Ad12 knob and the N-terminal domain of CAR (Bewley etal. (1999) Science 
266:1579-1583) show that the CD-loop does not contact CAR. It thus app ars 
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that different regions of the Ad5 and Ad37 fiber knobs recognize distinct cell 
receptors. 

A 46 kDa receptor for coxsackieviruses and adenoviruses (CAR) mediates 
attachment for many adenovirus serotypes. The wide distribution of CAR fails 
5 to explain why certain adenovirus serotypes (i.e. Ad37) are highly associated 
with severe ocular infections such as epidemic keratoconjunctivitis (EKC). Ad37 
does not use CAR, but instead uses a glycoprotein that contains sialic acid as its 
primary receptor (Arnberg et al. ((2000) J. Virol. 74:42-48). The modest 
number of Ad37 binding sites per cell (Huang et al. (1999) J. Virol. 73:2798- 

10 2802) also suggests that Ad37 recognizes a specific glycoprotein as its primary 
^receptor for binding to conjunctival cells. , « . * 

i I; * v> Adenovirus type 37 (subgroup D) has been associated; wijh. infections of 

:*k> the eye and genital tract. The tropism of Ad37 derives from Ahe binding 

preference of its fiber protein, which binds to a receptor located on the surface 

15 of cells including Chang C, conjunctival epithelial cell line (Huang etal. (1999) J. 
Virology 73:2798-2802). 

A protein receptor that is preferentially expressed on conjunctival cells to 
which Ad37 fiber. binds is shown herein. The preferential expression of the 
Ad37 receptor protein on conjunctival cells suggests that this receptor likely 

20 influences Ad37 tropism and should play a key role in ocular pathogenesis. It is 
shown herein that Ad37 uses a distinct protein receptor that is selectively 
expressed on conjunctival cells. It is shown that Ad37 binds well to conjunctival 
cells (Chang C), but poorly to lung carcinoma cells (A549). To determine if 
infection correlated with cell binding, an Ad5 vector containing the Ad37 fiber 

25 protein was constructed. The 'pseudotyped' vector delivered transgenes to 
Chang C cells better than to A549 cells. Ad37 binding was abolished by 
protease treatment of Chang C cells, indicating the receptor is a membrane 
protein. Ad37 binding to conjunctival cells is shown herein to be calcium- 
dependent. It is also shown that Ad37 infection was not inhibited by a function- 

30 blocking anti-CAR monoclonal antibody, which is a feature distinct from Ad5 
fiber interaction with CAR. Using a virus overlay protein blot assay (VOPBA), 
calciunvd pendent Ad37 binding to a 50 KDa m mbrane prot in on Chang C 
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cells, but not A549 cells was detected. Ad19p a closely related serotype that 
fails to bind to conjunctival cells, does not recognize the 50 kDa protein* 
Together, these data indicate that the 50 kDa protein is a candidate receptor for 
Ad37 on conjunctival cells. 
5 Significantly, it is also shown herein that, upon administration of the 

vector to the vitreous humor, the recombinant adenovirus with the Ad37 fiber 
preferentially and selectively binds to photoreceptor cells. Hence, a recombinant 
adenoviral delivery vehicle that has an Ad37 fiber protein can serve as a vector 
for delivery of therapeutic agents to the eye for treatment of ocular disorders, 

10 including genetic and acquired disorders. The identification of the receptor for 
Ad37 and the resulting recognition of Ad37 tropism allows targeting of 
adenovirus vectors.tQsspecific^human ocular cells. 

As noted,lifibef t plays a crucial role in adenovirus infection by attaching 
the virus to a specific receptor on a cell surface. Hexon, penton and fiber 

15 capsomeres are the major components on the surface of the virion. The fiber is 
an elongated protein which exists as a trimer of three identical polypeptides 
(polypeptide IV) of 582 amino acids in length. An adenovirus fiber includes 
three domains: an N-terminal tail domain that interacts with penton base; a 
shaft composed of variable numbers of repeats of a 1 5-amino-acid segment that 

20 forms beta-sheet and beta-bends; and a knob at the C-terminus (''head domain 17 ) 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. The gene encoding the fiber protein from Ad 2 has been 
expressed in human cells and has been shown to be correctly assembled into 
trimers, glycosylated and transported to the nucleus (see, e.g., Hong and Engler, 

25 Virology 185: 758-761, 1991). Thus, alteration of the fiber in recombinant Ad 
vectors can lead to alteration in gene delivery. 

As shown herein, alteration of fiber in recombinant Ad vectors such that 
the fiber is derived from Ad37 or another adenovirus serotype D, provides a 
means for selective delivery of a recombinant virus to particular cells in the eye, 

30 including conjunctival cells, and most significantly photoreceptors, thereby 
providing a means for targeted delivery to photoreceptor cells. 
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Photoreceptor cells are implicated in a number of hereditary and acquired 
retinal degenerative disorders. In addition, photoreceptor cells are located such 
that products produced therein can be delivered to other areas of the eye by 
virtue of the blood flow in the vicinity of the photoreceptor cells and also by 

5 virtue of the proximity of the photoreceptors to the retinal pigmented epithelium 
(RPE) and other retinal cells. 

Hence it is contemplated herein that the recombinant viral vector will 
include a packaged recombinant adenovirus genome containing at least the 
minimal elements for replication and packaging; heterologous DNA encoding a 

0 desired gene product typically a therapeutic product or plurality of products, 
such as several trophic factors, whose combined activity is effective for treating 
a disorder, such as a retinal degenerative disorder; andvtbei resulting virion * 
particles will include a fiber that has a sufficient ptfrtion^to^ctinfer specific, 
targeting to photoreceptor cells when the recombinant viral particles are 

5 introduced into the aqueous humor of a mammalian, preferably a human, eye, or 
otherwise contacted with the photoreceptor cells. The fiber may be a chimeric 
protein that has been modified for effective interaction with other coat structural 
proteins, such as penton. In addition, the fiber may be modified to. include other 
elements that alter its tropism to permit binding to other cells as well {see, e.g., 

» U.S. Patent Nos. 5,756,086 and 5,543,328, International PCT application No. 
WO 95/26412 and WO 98/44121 and Krasnykh, eta/. [J. ViroL 70: 6839-46, 
1996). 

C. Construction of the viral particles 

1 . Selection of viral genome and fiber protein 

5 Methods for preparing recombinant adenoviral vectors for gene 

product delivery are well known. Preferred among those are the methods 
exemplified herein (see EXAMPLES) and also described in copending U.S. 
application Serial No. 09/482,682 (also filed as International PCT application 
No. PCT/USOO/00265, filed January 14, 2000, which claims priority to U.S. 

) provisional application Serial No. 60/1 15,920, as does U.S. application Serial 
No. 09/482,682)). 
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As noted, any desired recombinant adenovirus is contemplated for use in 
the methods herein as long as the viral genome is packaged in a capsid that 
includes at least the portion of a fiber protein that provides selective binding to 
photoreceptor cells. This fiber protein is preferably from an adenovirus type D 
5 serotype and is preferably an Ad37 fiber. The fiber protein should retain the 
knob region at the C-terminus ("head domain") from the Ad virus of subgroup D 
that contains the type-specific antigen and is responsible for binding to the cell 
surface receptor. Hence the fiber protein can be a chimeric fiber protein as long 
as it retains a sufficient portion of the type D serotype to specifically or 

10 selectively bind to photoreceptor cells. Generally the portion retained will be all 

or a portion of the knob region. The precise amount of knob region required can » ^W* 

be determined ^empirically by including portions thereof and identifying the * 
riiinimurri^sicTue^ from and" Ad type D serotype, preferably Ad37, to effect" T^r^^b*" 
selective targeting of a virion packaged with such fiber to photoreceptors in the 

15 eye upon introduction of the packaged virion into the aqueous humor. 

Recombinant adenovirus containing heterologous nucleic acids that 
encode a desired product, such a gene to correct a genetic defect, may be made 
by any methods known to those of skill in the art. The viruses must be 
packaged in a cell line that results in expression of fiber on the particles that 

20 specifically, electively or preferentially targets (binds and results in 

internalization) the viral particle to cells in the eye. The fiber protein from Ad37 
and other Adenoviruses of serotype D that infect the eye effects such targeting. 
The resulting adenovirus particles that express such fiber is administered by 
intraocular injection, subretinal injection, particularly intravitreal injection, or any 

25 means that results in preferential accumulation in photoreceptor cells. 

The family of Adenoviridae includes many members with at least 47 
known serotypes of human adenovirus (Ad1-Ad47) (Shenk, Virology, Chapter 
67, in Fields et al., eds. Lippincott-Raven, Philadelphia, 1 996,) as well as 
members of the genus Mastadenovirus including human, simian, bovine, equine, 

30 porcine, ovine, canine and opossum viruses and members of the Aviadenovirus 
genus, including bird viruses, such as CELO. 
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Thus it is contemplated that the methods herein can be applied to any 
recombinant viral vectors derived from any adenovirus species. One of skill in 
the art would have knowledge of the different adenoviruses (see, e.flr.,Shenk, 
Virology, Chapter 67, in Fields etaL, eds. Lippincott-Raven, Philadelphia, 1996,) 
5 and can construct recombinant viruses containing portions of the genome of any 
such virus. 

In the exemplified embodiment, viral particles with Ad37 fiber were 
prepared. Site-directed mutations were made to the Ad37 fiber gene to make 
the tail sequence more closely match that of Ad 5 to facilitate Ad37 fiber binding 

10 to the Ad5 penton base. The plasmid for the expression of the Ad37 fiber 
protein, pDV80, contains the CMV promoter,, the adenovirus type 5 tripartite 
leader (TPL), and the modified Ad37 fibervgene -sequence. Genes of interest, 
such as nucleic acid encoding the )? subtftiiti'-bf&GMP phosphodiesterase (0PDE), 
^-glucuronidase, rhodopsin, growth factors, anti-cancer agents, growth factor 

15 receptors and other anti-angiogenic agents, and antt-apoptotic agents, can be 
incorporated into these vectors using the methods known to those of skill in the 
art and exemplified herein. 

Known adenovirus vectors, previously constructed for intraocular therapy 
(see, e.g., Bennett etaL (1996) Nature Medicine 2:649-654, which provides an 

20 Ad virus encoding /?PDE for treatment of retinitis pigmentosa; Cayouette et al. 
(1998) Human Gene Therapy 8:423-430, which provides an Ad vector that 
expresses CNTF for treatment of retinitis pigmentosa and other retinal 
degenerative diseases; and Li etaL (1995) Proc. Natl. Acad. ScL U.S.A. 
32:7700-7704, which provides an Ad virus vector that encodes a human ft- 

25 glucuronidase for treatment of lysosomal storage disease caused by /?- 

glucuronidase deficiency) can be modified by repackaging the recombinant 
. genome using a packaging line that expresses an Ad37 fiber or other D serotype 
fiber. 

For exemplification, nucleic acid encoding GFP was incorporated into 
30 th se vectors as a means to visualize their localization. Other genes, such as 
genes that encode therapeutic products, my be included in place of or in addition 
to GFP. 
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Plasmid pDV80 was electroporated into E1-2a S8 cells and stable lines 
were selected. The fiber-deleted vectors Ad5.jffgal.AF and Ad5.GFP.AF were 
grown in ceils in a resulting cell line, designated 705, to produce virions, which 
express the Ad37 fiber (Ad5.£gal.AF/37F and Ad5.GFP.AF/37F) and CsCI- 
5 purified. These virions selectively transduce photoreceptor cells when injected 
intraocularly into the vitreous humor. 

2. Packaging 

Recombinant adenoviral vectors generally have at least a deletion in the 
first viral early gene region, referred to as E1 r which includes the E1a and E1b 

10 regions. Deletion of the viral E1 region renders the recombinant adenovirus 

defective ^(^replication and incapable of producing infectious viral particles in , - 
«. ^^i^^ue^\y-mfected target cells. Thus, to generate E1 -deleted adenovirus 
t y^4genojrie replication and to produce virus particles requires a system of 
complementation which provides the missing E1 gene product. E1 

15 complementation is typically provided by a cell line expressing E1, such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides E1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham et aL, J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 

20 that may be usable for production of defective adenovirus having a portion of 
the adenovirus E4 region have been reported (WO 96/22378). 

Multiply deficient adenoviral vectors and complementing cell lines have 
also been described (WO 95/34671, U.S. Patent No. 5,994,106). 

Copending U.S. application Serial No. 09/482,682 (also filed as 

25 International PCT application No. PCT/US00/00265, filed January 14, 2000)) 
provides packaging cell lines that support viral vectors with deletions of major 
portions of the viral genome, without the need for helper viruses and also 
provides cell lines and helper viruses for use with helper-dependent vectors. 
The packaging cell line has heterologous DNA stably integrated into the 

30 chromosomes of the cellular genome. The heterologous DNA sequence encodes 
one or more aden virus r gulatory and/or structural polypeptides that 
complement the genes deleted or mutated in the adenovirus vector genome to 
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be replicated and packaged. The packaging cell line express, for example, one 
or more adenovirus structural proteins, polypeptides, or fragments thereof, such 
as penton base, hexon, fiber, polypeptide Ilia, polypeptide V, polypeptide VI, 
polypeptide VII, polypeptide VIII, and biologically active fragments thereof. The 
5 expression can be constitutive or under the control of a regulatable promoter. 
These cell lines are designed for expression of recombinant adenoviruses 
intended for delivery of therapeutic products. 

Particular packaging cell lines complement viral vectors having a deletion 
or mutation of a DNA sequence encoding an adenovirus structural protein, 
10 regulatory polypeptides E1A and E1B, and/or one or more of the following 

regulatory proteins or polypeptides: ,E2A,JE2B, E3, E4 r L4, or fragments thereof. 

The packaging cell lines ^are; produced by introducing each DNA molecule 
into the cells and then into the genome 1 via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
15 introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 
such as Ad 37, polypeptide or fragment thereof. 

For therapeutic applications, the delivery plasmid further includes a 
20 nucleotide sequence encoding a foreign polypeptide. Exemplary delivery 

plasmids include, but are not limited to, pDV44, pAE1By?-gal and pAE1sp1B. In 
a similar or analogous manner, therapeutic genes may be introduced. 

The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
25 chromosome(s) of the cellular genome of the cell. 

In one embodiment, a composition comprises a cell containing first and 
second delivery plasmids wherein a first delivery plasmid comprises an 
adenovirus genome lacking a nucleotide sequence encoding fiber and incapable 
of directing the packaging of new viral particles in the absence of a second 
30 delivery plasmid, and a second delivery plasmid comprises an adenoviral genome 
capable of dir cting the packaging of new viral particles in the presenc of the 
first delivery plasmid. 
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In a variation, the packaging cell line expresses fiber protein or chimeric 
variant thereof from an Ad virus of subgroup D, preferably Ad37, serotype or it 
can be any fiber protein but one that has been modified to include the portion of 
the Ad virus of subgroup D, such as Ad37, responsible for selective targeting to 
5 photoreceptors upon introduction into the vitreous humor of the eye of a 
mammal, preferably a human. The fiber protein can be further modified to 
include a non-native amino acid residue sequence that targets additional specific 
receptors. In all instances, the modification should not disrupt trimer formation 
or transport of fiber into the nucleus. In another variation, the non-native amino 
10 acid residue sequence alters the binding specificity of the fiber for a targeted cell 

'^ype. The structural protein is fiber can include amino acid residue sequences 
^; ^ 'from more than one adenovirus serotype. The nucleotide sequences 1 Wicodihg 
# fiber protein or polypeptide need not be modified solely at one br'bSth- termini; 

fiber protein, may be modified "internally" as well as at the termini. 
15 Additional nucleic acid fragments can encode polypeptides that are added 

to the fiber protein. In one variation, the non-native amino acid residue 
sequence is coupled to the carboxyl terminus of the fiber. In another, the non- 
native amino acid residue sequence further includes a linker sequence. 
Alternatively, the fiber protein further comprises a ligand coupled to the linker. 
20 Suitable ligands include, but are not limited to, ligands that specifically or 

4 selectively bind to a cell surface receptor and ligands that can be used to couple 
other proteins or nucleic acid molecules. Typically, the packaging cell lines will 
contain nucleic acid encoding the fiber protein or modified protein stably 
integrated into a chromosome or chromosomes in the cellular genome. 
25 The packaging cell line can be derived from a procaryotic cell line or from 

a eukaryotic cell line. While various embodiments suggest the use of mammalian 
cells, and more particularly, epithelial cell lines, a variety of other, non-epithelial 
cell lines are used in various embodiments. Thus, while various embodiments 
disclose the use of a cell line selected from among the 293, A549, W1 62, HeLa, 
30 Vero, 211, and 21 1 A cell lines, and any other cell lines suitable for such use ar 
likewise contemplated herein. 

3. C mponents f the nucleic acid molecule included in the particl 
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A recombinant viral vector or therapeutic viral vector for use in the 
methods herein, typically includes a nucleic acid fragment that encodes a 
protein or polypeptide molecule, or a biologically active fragment thereof, or 
other regulatory sequence, that is intended for use in therapeutic applications. 
5 The nucleic acid molecule to be packaged in the viral particle also may 

include an enhancer element and/or a promoter located 3' or 5' to and 
controlling the expression of the therapeutic product-encoding nucleic acid 
molecule if the product is a protein. Further, for purposes herein, the promoter 
and/or other transcriptional and translational regulatory sequences controlling 

10 expression of the product is preferably one that is expressed specifically in the 
targeted cells, such as the a pKotoreceptor-specific promoter, such as a 
rhodopsin gene prombteK^: ^^ *' ' 

The nucleic^aeid Molecule to be packaged in viral capsid includes at least 
2 different operatively linked DNA segments. The DNA can be manipulated and 

15 amplified by PCR as described herein and by using standard techniques, such as 
those described in Molecular Cloning: A Laboratory Manual, 2nd Ed., Sambrook 
et ah, eds., Cold Spring Harbor, New York (1989). Typically, to produce such 
molecule, the sequence encoding the selected polypeptide and the promoter or 
enhancer are operatively linked to a DNA molecule capable of autonomous 

20 replication in a cell either in vivo or in vitro. By operatively linking the enhancer 
element or promoter and nucleic acid molecule to the vector, the attached 
segments are replicated along with the vector sequences. 

Thus, the recombinant DNA molecule (rDNA) is a hybrid DNA molecule 
comprising at least 2 nucleotide sequences not normally found together in 

25 nature. In various preferred embodiments, one of the sequences is a sequence 
encoding an Ad-derived polypeptide, protein, or fragment thereof. The nucleic 
acid molecule intended to be packaged is from about 20 base pairs to about 
40,000 base pairs in length, preferably about 50 bp to about 38,000 bp in 
length. In various embodiments, the nucleic acid molecule is of sufficient length 

30 to encode on or more adenovirus pr teins or functional polypeptid portions 
thereof. Since individual Ad polypeptides vary in length from about 1 9 amin 
acid residues to about 967 amino acid residues, ncoding nucleic acid molecules 
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from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral 
vectors by therapeutic product-encoding nucleic acid molecules. 

Preferably the molecule includes an adenovirus tripartite leader <TPL) 
5 nucleic acid sequence operatively linked to an intron containing RNA processing 
signals (such as for example, splice donor or splice acceptor sites) suitable for 
expression in the packaging cell line. Most preferably the intron contains a splice 
donor site and a splice acceptor site. Alternatively, the TPL nucleotide sequence 
may not comprise an intron. The intron includes any sequence of 

10 nucleotides that function in the packaging cell line to provide RNA processing 
signals, including splicing signals. Introns have been well characterized from a 
large number of structural genes, and include but ja^e^pt^limited^to a native 
intron 1 from adenovirus, such as Ad5's TPL intrpn pothers include the SV40 
" VP intron; the rabbit beta-globin intron, and synthetic intron constructs (see, 

15 e.g., Petitclerc eta/. (1995W BiothechnoL, 40:169; and Choi etal (19910 MoL 
Cell. Biol., 17:3070). 

The nucleic acid molecule encoding the TPL includes either (a) first and 
second TPL exons or (b) first, second and third TPL exons, where each TPL exon 
in the sequence is selected from among the complete TPL exon 1 , partial TPL 

20 exon 1 , complete TPL exon 2 and complete TPL exon 3. A complete exon is 
one which contains the complete nucleic acid sequence based on the sequence 
found in the wild type viral genome. Preferably the TPL exons are from Ad2, 
Ad3, Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 
described herein. A preferred partial TPL exon 1 is described in the Examples. 

25 The use of a TPL with a partial exon 1 has been reported (International PCT 
application No. WO 98/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence. An intron may 
not be a part of the construct. For example, the intron can be positioned 

30 between any of TPL exons 1 , 2 or 3, and the exons can b in any order of first 
and second, or first/second/third. The intron can also be placed preceding th 
first TPL exon or following the last TPL exon. In a pr ferred embodiment. 
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complete TPL exon 1 is operatively linked to complete TPL exon 2 operatively 
linked to complete TPL exon 3. In a preferred variation, adenovirus TPL intron 1 
is positioned between complete TPL exon 1 and complete TPL exon 2. It may 
also be possible to use analogous translational regulators from other viral 
5 systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete 
TPL exons 1 , 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 
has a nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL 
nucleic acid molecule containing partial TPL exon 1 and complete TPL exons 2 
10 and 3 in that order has a nucleotide sequence shown in SEQ ID NO: 26. The 
construction of thesenpref erred TPL nucleotide sequences is described in the 
Examples, r. \ " :,r " ■• ** # -J 5i -- 

' ThU&^reffcrred expression cassettes and complementing plasmids for W-^5v^* 
expressing adenovirus structural genes, particularly fiber protein, contain an 
15 adenovirus TPL nucleotide sequence as described herein. 
4. Complementing Plasmids 

Also contemplated are the use of nucleic acid molecules, typically in the 
form of DNA plasmid vectors, which are capable of expression of an adenovirus 
structural protein or regulatory protein. Because these expression plasmids are 
20 used to complement the defective genes of a recombinant adenovirus vector 
genome, the plasmids are referred to as complementing or complementation 
plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product encoded by the nucleic acid 

25 molecule. Expression cassettes typically contain a promoter and a structural 

gene operatively linked to the promoter. The complementing plasmid can further 
include a sequence of nucleotides encoding TPL nucleotide to enhance 
expression of the structural gene product when used in the context of 
adenovirus genome replication and packaging. 

30 A complementing plasmid can include a promoter operatively linked to a 

s quence of nucleotides encoding an adenovirus structural p lypeptide, such as, 
but are not limited to, penton base; hexon; fiber; polypeptide Ilia; polypeptide V; 
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polypeptide VI; polypeptide VII; polypeptide VIII; and biologically active 
fragments thereof. In another variation, a complementing plasmid may also 
include a sequence of nucleotides encoding a first adenovirus regulatory 
polypeptide, a second regulatory polypeptide, and/or a third regulatory 
5 polypeptide, and any combination of the foregoing. 

Plasmid pDV80 is a preferred plasmid herein. Other plasmids constructed 
in an analogous manner to encode modified fiber proteins and chimeric fiber 
proteins are also contemplated herein. 

5. Nucleic Acid Molecule Synthesis 

10 A nucleic acid molecule comprising synthetic oligonucleotides can be 

prepared using any suitable method, such as the phosphotriester or 
phosphodiester methods (see, e.g., frararig-Mi'979) ei aL, Meth. EnzymoL, 
68:90; U.S. Patent No. 4,356,270; an'd : ; B>oWrf M a/., (iQlB) Meth. EnzymoL, 
68:109). For oligonucleotides, the synthesis of the family members can be ' 

1 5 conducted simultaneously in a single reaction vessel, or can be synthesized 
independently and later admixed in preselected molar ratios. For simultaneous 
synthesis, the nucleotide residues that are conserved at preselected positions of 
the sequence of the family member can be introduced in a chemical synthesis 
protocol simultaneously to the variants by the addition of a single preselected 

20 nucleotide precursor to the solid phase oligonucleotide reaction admixture when 
that position number of the oligonucleotide is being chemically added to the 
growing oligonucleotide polymer. The addition of nucleotide residues to those 
positions in the sequence that vary can be introduced simultaneously by the 
addition of amounts, preferably equimolar amounts, of multiple preselected 

25 nucleotide precursors to the solid phase oligonucleotide reaction admixture 
during chemical synthesis. For example, where all four possible natural 
nucleotides (A,T,G and C) are to be added at a preselected position, their 
precursors are added to the oligonucleotide synthesis reaction at that step to 
simultaneously form four variants (see, e.g., Ausubel et aL (Current Protocols in 

30 Molecular Biology, Suppl. 8. p.2.1 1 .7, John Wiley & Sons, Inc., N w York 
,1991). 
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Nucleotide bases other than the common four nucleotides (A,T,G or C), or 
the RNA equivalent nucleotide uracil (U), can also be used. For example, it is 
well known that inosine (I) is capable of hybridizing with A, T and G, but not C. 
Examples of other useful nucleotide analogs are known in the art and may be 
5 found referred to in 37 C.F.R. §1.822. 

Thus, where all four common nucleotides are to occupy a single position 
of a family of oligonucleotides, that is, where the preselected nucleotide 
sequence is designed to contain oligonucleotides that can hybridize to four 
sequences that vary at one position, several different oligonucleotide structures 
10 are contemplated. The composition can contain four members, where a 

preselected position contains A,T,G or C. Alternatively, a composition can ,* 
ri'i 'contain two nucleotide sequence members, where a preselected positions- ^ . ^ 
; ^r?5cohtains I or C, and has the capacity to hybridize at that position td^ll^urf^^ 

possible common nucleotides. Finally, other nucleotides may be included at the 
1 5 preselected position that have the capacity to hybridize in a non-destabilizing 
manner with more than one of the common nucleotides in a manner similar to 
inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 
20 ligation, as is well known. 

D. Adenovirus Expression Vector Systems 

The adenovirus vector genome that is encapsulated in the virus particle 
and that expresses exogenous genes in a gene therapy setting is a key 
component of the system. Thus, the components of a recombinant adenovirus 
25 vector genome include the ability to express selected adenovirus structural 
genes, to express a desired exogenous protein, and to contain 

sufficient replication and packaging signals that the genome is 
packaged into a gene delivery vector particle. The preferred 
replication signal is an adenovirus inverted terminal repeat 
30 containing an adenovirus origin of replication, as is well known 

and described her in. 
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Although adenovirus include many proteins, not all adenovirus proteins 
are required for assembly of a recombinant adenovirus particle (vector). Thus, 
deletion of the appropriate genes from a recombinant Ad vector permits 
accommodation of even larger "foreign" DNA segments. 
5 A preferred recombinant adenovirus vector genome is "helper 

independent" so that genome can replicate and be packaged without the help of 
a second, complementing helper virus. Complementation is provided by a 
packaging cell. 

In a preferred embodiment, the adenovirus vector genome does not 
10 encode a functional adenovirus fiber protein. A non-functional fiber gene refers 
to a deletion, mutation or other modification to the adenovirus fiber gene such 
that the gene does not express ahy or insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without complementation of the 
fiber gene by a complementing plasmid or packaging cell line. Such a genome is 
15 referred to as a "fiberless" genome, not to be confused with a fiberless particle. 
Alternatively, a fiber protein may be encoded but is insufficiently expressed to 
result in a fiber containing particle. 

Thus, contemplated for use are helper-independent fiberless recombinant 
adenovirus vector genomes that include genes that (a) express all adenovirus 
20 structural gene products but express insufficient adenovirus fiber protein to 

package a fiber-containing adenovirus particle without complementation of said 
fiber gene, (b) express an exogenous protein, and (c) contain an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 
replication. 

25 The adenovirus vector genome is propagated in the laboratory in the form 

■ of rDNA plasmids containing the genome, and upon introduction into an 
appropriate host, the viral genetic elements provide for viral genome replication 
and packaging rather than plasmid-based propagation. Exemplary methods for 
preparing an Ad-vector genome are described in the Examples. 

30 A vector her in includes a nucleic acid (preferably DNA) molecule capable 

of auton mous replication in a cell and to which a DNA segment, e.g., a gene or 
polynucleotide, can b operatively linked to bring about replication of the 
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attached segment. For purposes herein, one of the nucleotide segments to be 
operatively linked to vector sequences encodes at least a portion of a therapeutic 
nucleic acid molecule. As noted above, therapeutic nucleic acid molecules 
include those encoding proteins and also those that encode regulatory factors 
5 that can lead to expression or inhibition or alteration of expression of a gene 
product in a targeted cell. 

1 . Nucleic Acid Gene Expression Cassettes 
In various embodiments, a peptide-coding sequence of the therapeutic 
gene is inserted into an expression vector and expressed; however, it is also 

10 feasible to construct an expression vector which also includes some non-coding 
sequences as well. Preferably, however, non-coding sequences are excluded. 
Alternatively, a nucleotide sequence for a soluble form'of ^polypeptide may be 
utilized. Another preferred therapeutic viral vector includ r es;#hucleotide 
sequence encoding at least a portion of a therapeutic nucleotide sequence 

1 5 operatively linked to the expression vector for expression of the coding sequence 
in the therapeutic nucleotide sequence. 

The choice of viral vector into which a therapeutic nucleic acid molecule 
is operatively linked depends directly, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and 

20 the host cell to be transformed - these being limitations inherent in the art of 
constructing recombinant DNA molecules. Although certain adenovirus 
serotypes are recited herein in the form of specific examples, it should be 
understood that the use of any adenovirus serotype, including hybrids and 
derivatives thereof are contemplated. 

25 A translatable nucleotide sequence is a linear series of nucleotides that 

provide an uninterrupted series of at least 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector (RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 

30 nucleic acid material and any polypeptides that ar to be inserted into the c II. 
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2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector may also include a promoter, particularly a tissue or cell specific 
promoter, preferably one expressed in ocular cells, particularly photoreceptors. 
5 Promoters contemplaged for use herein include regulatable (inducible) as 

well as constitutive promoters, which may be used, either on separate vectors or 
on the same vector. Some useful regulatable promoters are those of the CREB- 
regulated gene family and include inhibin, gonadotropin, cytochrome c, 
glucagon, and the like. (See, e.g., International PCT application No. WO 

10 96/14061 ). Preferably the promoter selected is from a photoreceptor-specific 
gene, such as a rhodopsin.|gepe or gene that encodes a protein that regulates 
rhodopsin exprejs^qn^v ^ : 
E. Formulation and administration 

Compositions containing therapeutically effective concentrations of 

15 recombinant adenovirus delivery vectors are provided. These are for delivery of 
therapeutic gene products to cells, particularly cells express a particular 50 kDa 
receptor or other receptor with which the vectors interact. These cells include 
cells of the eye and genital tract. Of particular interest are photoreceptor cells of 
the eye. Administration is effected by any means through which contacting with 

20 the photoreceptors is effected. Preferable modes of administration include, but 
are not limited to, subretinal injection, particularly intravitreal injection, to provide 
access to photoreceptor cells. 

The recombinant viral compositions may also be formulated for 
implantation into the anterior or posterior chamber of the eye, preferably the 

25 vitreous cavity, in sustained released formulations, such as those adsorbed to 
biodegradable supports, including collagen sponges, or in liposomes. Sustained 
release formulations may be formulated for multiple dosage administration, so 
that during a selected period of time, such as a month or up to about a year, 
several dosages are administered. Thus, for example, liposomes may be 

30 prepared such that a total of about tw to up to about fiv or more times the 
single dosage is administ red in one injection. 
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The vectors are formulated in an ophthalmologically acceptable carrier for 
intraocular, preferably intravitreal, administration in a volume of between about 
0.05 ml and 0.150 ml, preferably about 0.05 and 0.100 ml. 

The composition can be provided in a sealed sterile vial containing an 
5 amount of a compound of formula I, that upon intraocular administration will 
deliver a sufficient amount of viral particles to the photoreceptors in a volume of 
about 50 to 1 50 //I, containing at least about 10 7 , more preferably at least "about 
10 s plaque forming units in such volume. Typically, the vials will, thus, contain 
about 0.150 ml of the composition. 
10 To prepare compositions the viral particles are dialzyed into a suitable 

ophthalmologically acceptable carrier or viral particles, for example, may be 
^ u concentrated and/or mixed therewith. The resulting mjxture may be a solution, 

^a^v* , suspension or emulsion. In addition, the v\i& particles may be fdrmulated as 

the sole pharmaceutical^ active ingredient in the composition or may be 
15 combined with other active agents for the particular disorder treated. 

For administration by intraocular injection or via eyedrops, suitable 
carriers include, but are not limited to, physiological saline, phosphate buffered 
saline (PBS), balanced salt solution (BSS), lactate Ringers solution, and solutions 
containing thickening and solubilizing agents, such as glucose, polyethylene 
20 glycol, and polypropylene glycol and mixtures thereof. Liposomal suspensions 
may also be suitable as pharmaceutically acceptable carriers. These may be 
prepared according to methods known to those skilled in the art. Suitable 
ophthalmologically acceptable carriers are known. Solutions or mixtures 
intended for ophthalmic use may be formulated as 0.01% - 10% isotonic 
25 solutions, pH about 5-7, with appropriate salts [see, e.g., U.S. Patent No. 
5,1 16,868, which describes typical compositions of ophthalmic irrigation 
solutions and solutions for local application]. Such solutions, which have a pH 
adjusted to about 7.4, contain, for example, 90-100 mM sodium chloride, 4-6 
mM dibasic potassium phosphate, 4-6 mM dibasic sodium phosphate, 8-12 mM 
30 sodium citrate, 0.5-1 .5 mM magnesium chloride, 1 .5-2.5 mM calcium chloride, 
15-25 mM sodium acetate, 10-20 mM D.L.-sodium £-hydroxybutyrate and 5-5.5 
mM glucose. 
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The compositions may be prepared with carriers that protect them from 
rapid elimination from the body, such as time release formulations or coatings. 
Such carriers include controlled release formulations, such as, but not limited to, 
microencapsulated delivery systems, and biodegradable, biocompatible polymers, 
5 such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 

polyorthoesters, polylactic acid and other types of implants that may be placed 
directly into the anterior or posterior chamber or vitreous cavity of the eye. The 
compositions may also be administered in pellets, such as Elvax pellets 
{ethylene-vinyl acetate copolymer resin). 

10 Liposomal suspensions, including tissue-targeted liposomes, may also be 

suitable as pharmaceutical^ acceptable carriers. For example, liposome 
formulations j may be prepared by methods known to those of skill in the art |seev.i v**, 

4 e*g?; >Kimm et al. (1983V Bioch. Bioph. Acta 728 :339-398: Assil et aL x (1987^^>^ 
Arch Ophthalmol. 105 :400: and U.S. Patent No. 4,522,81 1]. The viral particles 

15 may be encapsulated into the aqueous phase of liposome systems. 

The active materials can also be mixed with other active materials, that 
do not impair the desired action, or with materials that supplement the desired 
action or have other action, including viscoelastic materials, such as hyaluronic 
acid, which is sold under the trademark HEALON, which is a solution of a high 

20 molecular weight (MW) of about 3 millions fraction of sodium hyaluronate 
[manufactured by Pharmacia, Inc; see, e.g.. U.S. Patent Nos. 5,292,362, 
5,282,851, 5,273,056, 5,229,127, 4,517,295 and 4,328,803], VISCOAT 
[fluorine-containing (meth)acrylates, such as, 1H,1H,2H,2H-hepta- 
decafluorodecylmethacrylate; see, e.g. . U.S. Patent Nos. 5,278,126, 5,273,751 

25 and 5,214,080; commercially available from Alcon Surgical, Inc.], ORCOLON 
[see, e.g.. U.S. Patent No. 5,273,056; commercially available from Optical 
Radiation Corporation], methylcellulose, methyl hyaluronate, polyacrylamide and 
polymethacrylamide [see, e.g.. U.S. Patent No. 5,273,7511. The viscoelastic 
materials are present generally in amounts ranging from about 0.5 to 5.0%, 

30 pr ferably 1 t 3% by weight of the conjugate material and serve to coat and 
protect th treated tissues. The compositions may also include a dye, such as 



WO 01/83729 



PCT/EP01/04863 



-38- 

methylene blue or other inert dye, so that the composition can be seen when 
injected into the eye. Additional active agents may be included. 

The compositions can be enclosed in ampules, disposable syringes or 
multiple or single dose vials made of glass, plastic or other suitable material. 
5 Such enclosed compositions can be provided in kits. In particular, kits 

containing vials, ampules or other containers, preferably disposable vials with 
sufficient amount of the composition to deliver about 0.100 ml thereof, and 
disposable needles, preferably self sealing 25-30 gauge needles, are provided 
herein. 

10 Finally, the compounds may be packaged as articles of manufacture 

containing packaging material, typically a vial,^an ophthalmologically acceptable 
composition containing the virahparjicles^and a label that indicates the 
therapeutic use of the compositron#>^£ < fe ,J * " 

Also provided are kits for practice of the methods herein. The kits 

1 5 contain one or more containers, such as sealed vials, with sufficient composition 
for single dosage administration, and one or more needles, such as self sealing 
25-33 gauge needles, preferably 33 gauge or smaller needles, precisely 
calibrated syringes or other precisely calibrated delivery device, suitable for 
intravitreal injection. 

20 Administration of the composition is preferably by intraocular injection, 

although other modes of administration may be effective, if the sufficient 
amount of the compound achieves contact with the vitreous cavity. Intraocular 
injection may be effected by intravitreal injection, aqueous humor injection or 
injection into the external layers of the eye, such as subconjunctival injection or 

25 subtenon injection, or by topical application to the cornea, if a penetrating 
formulation is used. 
Administration 

The compositions containing the compounds are administered 
intraocularly or by other means, such as topically in the form of penetrating 
30 eyedrops, whereby contact f the recombinant v ctors with the aqueous humor 
is ffected. Intraocular administration may be effected by intravitreal inj ction, 
aqueous humor injection, injection into the external layers of the eye, such as 
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subconjunctival injection or subtenon injection, preferably in free form, but, 
alternatively, in liposomes or other sustained drug delivery device. 
Administration is preferably by intravitreal injection, preferably through self 
sealing 25-30 gauge needles or other suitably calibrated delivery device. 
5 Injection into the eye may be through the pars plana via the self-sealing needle. 

It is further understood that, for any particular subject, specific dosage 
regimens should be adjusted over time according to the individual need and the 
professional judgment of the person administering or supervising the 
administration of the recombinant viruses, and that the concentration ranges set 
10 forth herein are exemplary only and are not intended to limit the scope or 

practice of the claimed methods 4 ,<rw2 

u Oi-:^\iCt\; ■ F. " Diseases, Disorders and therapeutic products * * a?"* ;» * 

i|»^^A-.^> 1 . Disease and disorders ^y-'^' ;t >^ 

Retinitis pigmentosa 

1 5 Methods for specifically or selectively targeting recombinant adenovirus 

vectors for delivery of gene products, particularly therapeutic products are 
provided herein. These methods are particularly suitable for targeting cells that 
express receptors that are selectively recognized by Ad virus of subgroup D 
viruses, particularly Ad37. It is shown herein that these viruses selectively 

20 recognize receptors on cells, such as conjunctival cells and photoreceptors, that 
are not recognized by other adenoviruses. Hence, methods for targeting to 
these cell types by providing vectors that are packaged in viral particles that 
contain a sufficient portion of a fiber protein from one of these Ad serotypes to 
bind to these receptors. These methods are useful for targeting to 

25 photoreceptors and for treating ocular disorders, including, but are not limited to, 
inherited and acquired retinal, neovascular degenerative diseases (see table 
below). 

It is estimated that 1 in 3,500 individuals in the United States suffer from 
one of the pigmented retinopathies. This group of retinal diseases, commonly 
30 call d retinitis pigmentosa, is characterized by progressive loss of peripheral and 
night vision. Patients may be affected at almost any age and it is not 
uncommon to experienc symptoms in early childhood in certain inherited forms. 
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It has been shown that there are a variety of mutations in genes expressed in 
the photoreceptors, including genes in the rhodopsin gene and pathway that 
appear to be responsible for these diseases. In addition to mutations in 
rhodopsin, changes in the retinal pigmented epithelial (RPE) cells, also undergo 
5 degenerative changes and can form clumps of pigment that give rise to the 
characteristic pigmentary changes seen in patients with RP. 

Angiogenesis and ocular diseases and disorders 
The vast majority of diseases that cause catastrophic loss of vision do so 
as a result of ocular neovascularization; age related macular degeneration 

10 (ARMD) affects 12-15 million American over the age of 65 and causes visual 
loss in 10-15% of them as a direct-effect of choroidal (sub-retinal) 
neovascularization. Thesleading cause of visual loss for Americans under the 
age of 65 is didbeteS;^T.6 ? milli'6n individuals in the United States are diabetic and 
40,000 per year suffer from ocular complications of the disease, which often are 

15 a result of retinal neovascularization. Laser photocoagulation has been effective 
in preventing severe visual loss in subgroups of high risk diabetic patients, but 
the overall 10 year incidence of retinopathy remains essentially unchanged. For 
patients with choroidal neovascularization due to ARMD or inflammatory eye 
disease, such as ocular histoplasmosis, photocoagulation, with few exceptions, 

20 is ineffective in preventing visual loss. While recently developed, non- 
destructive photodynamic therapies hold promise for temporarily reducing 
individual loss in patients with previously untreatable choroidal 
neovascularization, only 61 .4% of patients treated every 3-4 months had 
improved or stabilized vision compared to 45.9% of the placebo-treated group. 

25 In the normal adult, angiogenesis is tightly regulated and limited to wound 

healing, pregnancy and uterine cycling. Angiogenesis is turned on by specific 
angiogenic molecules such as basic and acidic fibroblast growth factor (FGF), 
vascular endothelial growth factor IVEGF), angiogenin, transforming growth 
factor (TGF), tumor necrosis factor-cr (TNF-ff) and platelet derived growth factor 

30 (PDGF). Angiogenesis can be suppressed by inhibitory molecules such as 

interferon-a, thrombospondin-1 , angiostatin and endostatin. It is the balance of 
these naturally occurring stimulators and inhibitors that controls the normally 
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quiescent capillary vasculature. When this balance is upset, as in certain disease 
states, capillary endothelial cells are induced to proliferate, migrate and 
ultimately differentiate. 

Angiogenesis plays a central role in a variety of diseases, including, but 
5 are not limited to, cancer and ocular neovascularization. Sustained growth and 
metastasis of a variety of tumors has also been shown to be dependent on the 
growth of new host blood vessels into the tumor in response to tumor derived 
angiogenic factors. Proliferation of new blood vessels in response to a variety of 
stimuli occurs as the dominant finding in the majority of eye diseases that blind, 

10 such as, but are not limited to, proliferative diabetic retinopathy (PDR), ARMD, 
rubeotic glaucoma, interstitial keratitis and retinopathy of prematurity. In these 
diseases, 0 tissue damage can stimulate release of angiogenic factors resulting in 
capillary proliferation. VEGF plays a dominant role in'iris neovascularization and 
neovascular retinopathies. While reports clearly show a correlation between 

15 intraocular VEGF levels and ischemic retinopathy ocular neovascularization, FGF 
likely plays a role. Basic and acidic FGF are known to be present in the normal 
adult retina, even though detectable levels are not consistently correlated with 
neovascularization. This may be largely due to the fact that FGF binds very 
tightly to charged components of the extracellular matrix and may not be readily 

20 available in a freely diffusible form that would be detected by standard assays of 
intraocular fluids. 

A final common pathway in the angiogenic response involves integrin- 
mediated information exchange between a proliferating vascular endothelial cell 
and the extracellular matrix. This class of adhesion receptors, called integrins, 

25 are expressed as heterodimers having an a and ft subunit on all cells. One such 
integrin, aji z , is the most promiscuous member of this family and allows 
endothelial cells to interact with a wide variety of extracellular matrix 
components. Peptide and antibody antagonists of this integrin inhibit 
angiogenesis by selectively inducing apoptosis of the proliferating vascular 

30 endothelial cells. Tw cytokine-dependent pathways of angiogenesis exist and 
may be defined by their dependency on distinct vascular cell integrins, aj? 3 and 
cgP 5 . Specifically, basic FGF- and VEGF-induc d angiogenesis depend on integrin 
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aJ3 3 and cg? 5 , respectively, since antibody antagonists of each integrin 
selectively block one of these angiogenic pathways in the rabbit corneal and 
chick chorioallantoic membrane (CAM) models. Peptide antagonists that block 
all or v integrins inhibit FGF- and VEGF-stimulated angiogenesis. While normal 
5 human ocular blood vessels do not display either integrin, aj9 3 and aji s integrins 
are selectively displayed on blood vessels in tissues from patients with active 
neovascular eye disease- While only aJ3 z was consistently observed in tissue 
from patients with ARMD,agff 3 and a^ 5 were present in tissues from patients 
with PDR. Systemically administered peptide antagonists of integrins blocked 

10 new blood vessel formation in a mouse model of retinal vasculogenesis. 

In addition to ajd^esion events described above, cell migration through the 
extracellular .matrix, also depends on proteolysis. Matrix metalloproteinases are a 1 
family of, zinc-requiring matrix-degrading enzymes that include the collagenases, J ?x&* 
gelatinases and stromelysins, all of which have been implicated in invasive cell 

15 behavior. Invasive cell processes such as tumor metastasis and angiogenesis 
have been found to be associated with the expression of integrins and MMP-2, 
MMP-2 are all found throughout the eye where they may interact to maintain a 
quiescent vasculature until the balance is upset, resulting in pathological 
angiogenesis. A non-catalytic C-terminal hemopexin-like domain of MMP-2 

20 (PEX) can block cell surface collagenolytic activity and inhibit angiogenesis in the 
CAM model by preventing localization of MMP-2 to the surface of invasive cells 
through interaction with the integrin ajff 3 . 

Hence, anti-angiogenic agents have a role in treating retinal degeneration 
to prevent the damaging effects of these trophic and growth factors. 

25 Angiogenic agents, also have a role in promoting desirable vascularization to 
retard retinal degeneration by enhancing blood flow to cells. 

Members of adenovirus subgroup D, Ad8, 19 A, and 37, are infectious 
agents that cause particularly severe cases of epidemic keratoconjunctivitis 
(EKC) (Arnberg eta/. (1998) Virology 227:239-244; Curtis etaL (1998) 

30 J.Med.MicrobioL 47:91-94; Ritterband etaL (1998) Rev.Med.ViroL &1 87-201; 
and Takeuchi etaL (1999) J. Clin. Micro bio L 37:3392-3394). There is no 
eff ctiv treatment for this debilitating and contagious diseas and EKC 



WO 01/83729 



PCT/EP01/04863 



-43- 



continues to be a problem in ophthalmology clinics worldwide (Curtis et af. 
(1998) J.Med.Microbiol. 47:91-94, Lukashok etal. (1998) 
Curr.Clin.Top.lnfec.Dis. 75:286-304). Hence the vectors herein may be used for 
treating the disease. 

Table 3 

Candidate targets for ocular disease therapy 



CANDIDATE TARGETS FOR OCULAR DISEASE THERAPY 


Disease 


Candidate target(s) 


Retinitis pigmentosa 


Rhodopsin gene, and genes that 
regulate expression thereof 
rcfe/peripherin 


Stargardt's disease 


rim protein (ARC protein) 






Choroideremia 


rab geranylgeranyl transferase 
CHM, TCD, CHML* 


Gyrate Atrophy 


ornithine aminotransferase 


Macular dystrophy 


ycfe/peripherin 



* see, "MSR6-yeast homologue of the choroideraemia gene," Nature Genetics 
3:193-4(1993) 

TABLE 4 



Other Diseases 



Exudative Choroidal Diseases 



ICSC; fluorescein angiogram 



ICSC with large serious detachment of RPE (retinal pigmented epithelium) 



ICSC with bullous retinal detachment 



Macular drusen, exudative, confluent 



Drusen, sub-RPE choroidal neovascularization 



Drusen, notched serous detachment of RPE 
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Other Diseases 



Drusen, notched serous and hemorrhagic detachment of RPE 



Drusen, serous and hemorrhagic detachment of RPE and retina 



Drusen, organized RPE detachment causing bullous retinal detachment 



Drusen, geographic atrophy of RPE 



Drusen, exudative and cuticular, vitellif orm macular detachment 



Drusen, cuticular, large vitelliform macular detachment 



North Carolina dystrophy with macular staphyloma 



North Carolina dystrophy with macular staphyloma 



Ahgioid streaks, pseudoxanthoma elasticum (PXE), CNVM 



Angioid streaks, PXE, large notched retinal detachment ■ ji ap 



Myopic degeneration, Foerster-Fuchs spot - S 



Presumed ocular histoplasmosis syndrome (POHS) 



Submacular bacterial abscess 



Toxocara cam's, subretinal granuloma 



Serpiginous (geographic) choroiditis 



Posterior scleritis 



Harada's disease 



Posterior sympathetic uveitis 



Benign reactive lymphoid hyperplasia of uveal tract 



Choroidal ruptures and CNVM 



Cavernous hemangioma of choroid 



Choroidal osteoma 



Choroidal nevus, serous macular detachment 



Choroidal nevus with CNVM 



Diffuse sclerochoroidal melanocytic nevus 



Choroidal melanoma with serous detachment of RPE 



M tastatic lung carcinoma to choroid 



Sub-RPE reticulum cell sarcoma 



WO 01/83729 PCT/EP01/04863 



-45- 



Other Diseases 



RPE tear, idiopathic choroidal neovascularization 



Heredodystrophic Disorders Affecting RPE & Retina 



Best's vitelliform macular dystrophy 



Best's vitelliform macular dystrophy with CNVM 



Best's vitelliform macular dystrophy, multiple lesions 



Adult-onset vitelliform f oveomacular dystrophy 



Pattern dystrophy simulating fundus flavimaculatus 



Stargardt's disease (fundus flavimaculatus) 



Asteroid macular dystrophy 



10 



Sjogren-Larssen syndrome 



Oguchi's disease, light-adapted, state k 



Oguchi's disease, dark-adapted state 



Fundus albipunctatus 



Retinitis pigmentosa, cystoid macular edema 



15 



Crystalline tapetoretinal dystrophy 



Choroideremia 



Goldmann-Favre syndrome 



Sex-linked juvenile retinoschisis 



Perivenous retinitis pigmentosa 



20 



Retinal Vascular Disorders 



Retinal arteriovenous aneurysm 



Central retinal artery occlusion 



Cilioretinal artery obstruction 



Ischemic retinopathy in systemic lupus erythematosus 



25 



Ischemic retinopathy in scleroderma 



Hemorrhagic detachment of internal limiting membrane, hypertensive 
retinopathy 



Acquired retinal arterial macroaneurysm 
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Other Diseases 



Cystoid macular edema, aphakic 



Cystoid macular edema, nicotinic acid maculopathy 



Congenital retinal telangiectasis 



Acquired bilateral juxtafoveal telangiectasis 



Acquired bilateral juxtafoveal obliterative telangiectasis 



Diabetic optic neuropathy 



X-ray radiation exudative retinopathy 



Sickle cell SC disease, macular hemorrhage 



Retinal arterial aneurysms, arteritis, neuroretinitis -fffi 



Branch retinal vein obstruction (BRVO) 



BRVO, exudative maculopathy 



BRVO, optic disc new vessels, photocoagulation 



Waldenstrom's macroglobulinemia 



Inflammatory Diseases of the Retina and Choroid 



Luetic retinal vasculitis 



Focal Candida retinal abscess 



Toxoplasmosis, atrophic chorioretinal scar 



Toxoplasmosis retinitis and macular detachment 



Toxoplasmosis scar, CNVM, macular detachment 



Diffuse unilateral subacute neuroretinitis, small worm 



Diffuse unilateral subacute neuroretinitis, large worm 



Cytomegalic inclusion disease, papillitis 



Acute posterior multifocal placoid pigment epitheliopathy 



Acute macular neuroretinitis 



Sarcoid retinitis 



Sarcoid papillitis 



Behcet's disease 



Vitiliginous (bird-shot) chorioretinitis 
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Other Diseases 



Multifocal choroiditis and panveitis (pseudo-POHS) 



Retinal and Pigment Epithelial Hamartomas 



Congenital grouped albinotic RPE spots 



Congenital hyperplasia of RPE 



Combined RPE and retinal hamartoma, juxtapapillary 



Combined RPE and retinal hamartoma, peripheral 



Cystic astrocytoma, juxtapapillary 



Astrocytoma, macula 



Astrocytoma, juxtapapillary 



Cavernous hemangioma; joHetina 



Juxtapapillary sessile' retinal capillary hemangioma 



Juxtapapillary endophytic retinal capillary hemangioma 



Other Tumors of the Choroid 



Choroidal metastasis 



Choroidal osteoma 



Choroidal hemangioma 



Miscellaneous uveal tumors 



Intraocular Lymphoid Tumors 



The leukemias and lymphomas 



Tumors of the Vitreous 



Non-Hodgkins ("reticulum cell") lymphoma 



Tumor involvement of the vitreous cavity 



Macular Disease 



Age-related macular degeneration — atrophic form 



Exudative age-related macular degeneration 



Choroidal neovascular membrane in degenerative myopia 



C ntral serous retinopathy 



Macular hole 
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Other Diseases 



Macular dystrophies 



Retinal Vascular Disease 



Etiologic mechanisms in diabetic retinopathy 



Background diabetic retinopathy 



Proliferative diabetic retinopathy 



Retinal arterial obstructive disease 



Central retinal vein occlusion 



Retinal branch vein occlusion 



Pregnancy and retinal disease 



Pregnancy-induced hypertension ' v.-n 



Hypertension 



The rheumatic disease 



Parafoveal telangiectasis 



Coats disease 



Disseminated intravascular systemic coagulopathy and related vasculopathies 



Hemoglobinopathies 



Retinopathy of prematurity 



Acquired retinal macroaneurysms 



Eales disease 



Radiation retinopathy 



The ocular ischemic syndrome 



Inflammatory Disease 



Ocular toxoplasmosis 



Ocular toxocariasis 



Ocular cysticercosis 



Cytomegalovirus infections f the retina 



Retinal and ophthalmologic manifestations of AIDS 



Acute retinal n crosis syndrome 
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Other Diseases 



Endogenous fungal infections of the retina and choroid 



Pars planitis 



Syphilis and tuberculosis 



Diffuse unilateral subacute neuroretinitis 



Scleritis 



Birdshot retinochoroidopathy 



Punctate inner choroidopathy 



Sarcoidosis 



Acute multifocal placoid pigment epitheliopatby 



10 



Geographic helicoid peripapillary choroidopathy (GHPC): serpiginous 
choroiditis^-^ :^rW" ' 



Sympathetic ophthalmia 



Vogt-Koyanigi-Harada syndrome (uveomeningitic syndrome) 



Ciliochoroidal (uveal) effusion 



15 



Reproduced from: Stereoscopic Atlas of Ocular Diseases Diagnosis and Treat- 
ment, 2nd Edition, J. Donaid O. Gass, Vol. 1 & 2, C.V. Mosley Co. (1987); 
and Retina Vol. II, Editor, Stephen J. Ryan, Medical Retina, C.V. Moslay Co. 
(1989). 

20 2. Therapeutic products 

Therapeutic products include but are not limited to, wild-type genes that 
are defective in ocular disorders, such as rhodopsin, or fragments thereof 
sufficient to correct the genetic defect, trophic factors, including growth factors, 
inhibitors and agonists of trophic factors, anti-apoptosis factors and other 

25 products described herein or known to those of skill in the art to be useful for 
treatment of disorders of the eye or that can be treated by a product expressed 
by a photoreceptor. 



OCULAR GENE THERAPY STRATEGIES 


GENERAL DISEASE 


EXAMPLES 


STRATEGY 
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OCULAR GENE THERAPY STRATEGIES 


Hereditary retinal and 
macular degeneration 


• Retinitis pigmentosa 

• Stargardt's disease 

• Other macular 
dystrophies 


Growth factors (e.g., 
GDNF) 

anti-apoptotic factors 
(e.g., bcl2 gene) 
Stargardt Disease Gene 
(ABCR) t 


Neovascular 


• Diabetes 

• Choroidal 
neovascularization 


Anti-angiogenesis 
factors 


Anti-tumor 


Retinoblastoma 


Antiproliferant 


Glaucoma 


Nerve fiber layer 
atrophy , 


Neuroprotective agent 



v See Allikmets etal. (1997) Science 277:1805-1807;. ~ " : 

For example, for treatment of retinitis^pigmentosa the adenovirus vector 

can deliver a wild-type rhodopsin gene or a grbwtlf f actor or trophic factor, such 

as ciliary neurotrophic factor CNTF; for treatment of Stargardt's disease, the 



10 vector can deliver a wild type ABCR (also called STGD1) or a growth factor or 
anti-angiogenic agent; for diabetic retinopathies, retinal vascularization the 
vector can deliver growth factors, such as a TGF (TGF/?), to prevent 
degeneration. 

15 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

Preparation of Adenovirus Packaging Cell Lines 

Cell lines that are commonly used for growing adenovirus are useful as 

20 host cells for the preparation of adenovirus packaging cell lines. Preferred cells 
include 293 cells, an adenovirus-transformed human embryonic kidney cell line 
obtained from the ATCC, having Accession Number CRL 1 573; HeLa, a human 
epithelial carcinoma cell line (ATCC Accession Number CCL-2); A549, a human 
lung carcinoma cell line (ATCC Accession Number CCL 1 889); and other 

25 epithelial-derived cell lines. As a r suit of th adenovirus transformation, the 
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293 cells contain the E1 early region regulatory gene. AH cells were maintained 
in complete DMEM + 10% fetal calf serum unless otherwise noted. 

These cell lines allow the production and propagation of adenovirus-based 
gene delivery vectors that have deletions in preselected gene regions and that 
5 are obtained by cellular complementation of adenoviral genes. To provide the 
desired complementation of such deleted adenoviral genomes in order to 
generate a viral vector, plasmid vectors that contain preselected functional units 
have been designed. Such units include but are not limited to E1 early region, 
E4 and the viral fiber gene. The preparation of plasmids providing such 
10 complementation, thereby being "complementary plasmids or constructs," that 
are stably inserted into host cell chromosomes are described below. 

" ^ h? : ~ ? Preparation of an E4-Expressing Plasmid for Complementation of - 

Hfciri ..bi -:r* E4-Gene-Deleted Adenoviruses : , 

The viral E4 regulatory region contains a single transcription unit that is 

15 alternately spliced to produce several different mRNA products. The 

E4-expressing plasmid prepared as described herein and used to transfect the 
293 cell line contains the entire E4 transcription unit. A DNA fragment 
extending from 175 nucleotides upstream of the E4 transcription start site 
including the natural E4 promoter to 1 53 nucleotides downstream of the E4 

20 polyadenylation signal including the natural E4 terminator signal, corresponding 
to nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as 
Ad5) genome as described in Chroboczek eta/. [Virol., 186:280-285 (1992), 
GenBank Accession Number M73260), was amplified from Ad5 genomic DNA, 
obtained from the ATCC, via the polymerase chain reaction (PGR). Sequences of 

25 the primers used were 5 r CGGTACACA GAATTCA GGAGACACAACTCC3' 
(forward or 5' primer referred to as E4L) (SEQ ID NO: 1) and 
5'GCCTGGATCCGGGAAGTTACGTAACGTGGGAAAAC3' (SEQ ID NO: 2) 
(backward or 3' primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create new sites for the 

30 restriction enzymes EcoR! and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PCR using 30 cycles of 92 C for 1 minut , 
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50 C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length E4 
gene products. 

The amplified DNA E4 products were then digested with EcoRI and 
BamHI for cloning into the compatible sites of pBluescript/SK+ by standard 
5 techniques to create the plasmid pBS/E4. A 2603 base pair (bp) cassette 
including the herpes simplex virus thymidine kinase promoter, the hygromycin 
resistance gene, and the thymidine kinase polyadenylation signal was excised 
from the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with Fspl 
followed by addition of BamHI linkers (5'CGCGGATCCGCG3') (SEQ ID NO: 3) 

10 for subsequent digestion with BamHI to isolate the hygromycin-containing 
fragment. ■•* 

The isolated BamHI-modified : fragment was then cloned into the BamHI 
site of pBS/E4 containing the E4 region to create the plasmid pE4/Hygro 
containing 8710 bp. The pE4/Hygro plasmid has been deposited with the ATCC 

15 under accession number 97739. The complete nucleotide sequence of 

pE4/Hygro is set forth in SEQ ID NO: 4. Position number 1 of the linearized 
vector corresponds to approximately the middle portion of the pBS/SK + 
backbone. The 5' and 3' ends of the E4 gene are located at respective 
nucleotide positions 3820 and 707 of SEQ ID NO: 4 while the 5' and 3' ends of 

20 the hygromycin insert are located at respective nucleotide positions 3830 and 

6470. In the clone that was selected for use, the E4 and hygromycin resistance 

genes were divergently transcribed. 

B. Preparation of a Fiber-Expressing Plasmid for Complementation of 
Fiber-Gene-Deleted Adenoviruses 

25 To prepare a fiber-encoding construct, primers were designed to amplify 

the fiber coding region from Ad5 genomic DNA with the addition of unique 
BamHI and Notl sites at the 5' and 3' ends of the fragment, respectively. The 
Ad 5 nucleotide sequence is available with the GenBank Accession Number 
M18369. The 5' and 3' primers had the respective nucleotide sequences of 

30 5'ATGGGATCCAAGATGAAGCGCGCAAGACCG3' {SEQ ID NO: 5) and 

5'CATAAC GCGGCCGCT TCTTTATTCTTGGGC3' (SEQ ID NO: 6), wher th 
inserted BamHI and Notl sites are indicated by underlining. The 5' primer also 
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contained a nucleotide substitution 3 nucleotides 5' of the second ATG codon (C 
to A) that is the initiation site. The nucleotide substitution was included so as to 
improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamHI and Notl sites 
5 of pcDNA3 (Invitrogen) to create the plasmid designated pCDNA3/Fiber having 
7148 bp. The parent plasmid contained the CMV promoter, the bovine growth 
hormone (BHG) terminator and the gene for conferring neomycin resistance. The 
viral sequence included in this construct corresponds to nucleotides 
31040-32791 of the Ad5 genome. 

10 The complete nucleotide sequence of pCDNA3/Fiber is listed in SEQ ID 

NO: 7 where the nucleotide position 1 corresponds to approximately the midcJIe 
% it of the pcDNA3 vector sequence. The 5' and 3' ends of the fiber genVSren.' 
r located at respective nucleotide positions 916 with ATG and 2661 wrtH TAA. 

To enhance expression of fiber protein by the constitutive CMV promoter 

15 provided by the pcDNA vector, a Bglll fragment containing the tripartite leader 
(TPL) of adenovirus type 5 was excised from pRD1 12a (Sheay eta/., 
BioTechniques, 15:856-862 (1993) and inserted into the BamHI site of 
pCDNA3/Fiber to create the plasmid pCLF having 7469 bp. The adenovirus 
tripartite leader sequence, present at the 5' end of all major late adenoviral 

20 mRNAs as described by Logan et aL, Proc.JSIatl. Acad. ScL, USA, 81 :3655- 

3659 (1984) and Berkner, BioTechniques, 5:616-629 (1988), also referred to as 
a "partial TPL" since it contains a partial exon 1 , shows correspondence with the 
Ad5 leader sequence having three spatially separated exons corresponding to 
nucleotide positions 6081-6089 (the 3' end of the first leader segment), 7111- 

25 7182 (the entire second leader segment), and 9644-9845 (the third leader 

segment and sequence downstream of that segment). The corresponding cDNA 
sequence of the partial tripartite leader sequence present in pCLF is included in 
SEQ ID NO: 8 bordered by BamHI/Bglll 5' and 3' sites at respective nucleotide 
positions 907-912 to 1228-1233. The nucleotide sequence of an isolated partial 

30 TPL is also listed separately as SEQ ID No. 22 with the noted 5' and 3' 

restriction sites and with the following nucleotide regions identified: 1-6 nt Bglll 
site; 1-18 nt polylinker; 19-27 nt last 9 nt of the first leader segment (exon 1); 
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28-99 nt second leader segment (exon 2); 100-187 nt third leader segment 
(exon 3); 188-301 nt contains the nt sequence immediately following the third 
leader in the genome with an unknown function; and 322-327 nt Bglll site. 
The pCLF plasmid has been deposited with the ATCC as described in 
5 Example 4. The complete nucleotide sequence of pCLF is listed in SEQ ID NO: 8 
where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5' and 3' ends of the Ad5 fiber gene are 
located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 
with TAA. 

10 C. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 

Encoding Functional E4 and Fiber Proteins 

The 293 cell line was selected for preparing the first adenovirus 

packaging line as it already contains the E1 gene as prepared by Graham eta/., 

J. Gen. ViroL, 35:59-74 (1977) and as further characterized by Spector, ViroL, 

15 730:533-538 (1983). Before electroporation, 293 cells were grown in RPMI 
medium + 10% fetal calf serum. Four x 106 cells were electroporated with 20 
//g each of pE4/Hygro DNA and pCLF DNA using a BioRad GenePulser and 
settings of 300 V, 25 jjF. DNA for electroporation was prepared using the 
Qiagen system according to the manufacturer's instructions (Bio-Rad, Richmond, 

20 CA). 

Following electroporation, cells were split into fresh complete DMEM + 
10% fetal calf serum containing 200pg/ml Hygromycin B (Sigma, St. Louis, 
MO). 

From expanded colonies, genomic DNA was isolated using the 
25 "MICROTURBOGEN" system (Invitrogen) according to manufacturer's 

instructions. The presence of integrated E4 DNA was assessed by PCR using 
the primer pair E4R and ORF6L (5'TGCTTAAGCGGCCGCGAAGGAGA AGTCC3') 
(SEQ ID NO: 9), the latter of which is a 5' forward primer near adenovirus 5 
open reading frame 6. 
30 One clone, designated 21 1 , was selected exhibiting altered growth 

properties relative to that seen in parent cell line 293. The 211 clone contained 
the product,indicating the presence of inserted DNA corresponding to most, if 
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not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 211 cell 
line has been deposited with the ATCC as described in Example 4, This line was 
further evaluated by amplification using the primer pair E4L/E4R described 
above, and a product corresponding to the full-length E4 insert was detected. 
5 Genomic Southern blotting was performed on DNA restricted with EcoRI and 
BamHI. The E4 fragment was then detected at approximately one copy/genome 
compared to standards with the EcoRI/BamHI E4 fragment as cloned into pBS/E4 
for use as a labeled probe with the Genius system according to manufacturer's 
instructions (Boehringer Mannheim, Indianapolis, IN)- In DNA from the 211 cell 
10 line, the labeled internal fragment pE4/Hygro hybridized with the isolated E4 
sequences. In addition, the probe hybridized to a larger fragment which may be 
the result of a second insertion event. v k * < 

• - - * 

Although the 211 cell line was not selected by neomycin resistance, thus 
indicating the absence of fiber gene, to confirm the lack of fiber gene, the 211 

15 cell line was analyzed for expression of fiber protein by indirect 

immunofluorescence with an anti-fiber polyclonal antibody and a FITC-labeled 
anti-rabbit JgG (KPL) as secondary. No immunoreactivity was detected. 
Therefore, to generate 21 1 clones containing recombinant fiber genes, the 211 
clone was expanded by growing in RPMI medium and subjected to additional 

20 electroporation with the fiber-encoding pCLF plasmid as described above. 

Following electroporation, cells were plated in DMEM + 10% fetal calf 
serum and colonies were selected with 200//g/ml G418 (Gibco, Gaithersburg, 
MD). Positive cell lines remained hygromycin resistant. These candidate 
sublines of 21 1 were then screened for fiber protein expression by indirect 

25 immunofluorescence as described above. The three sublines screened, 211 A, 
21 1B and 21 1R, along with a number of other sublines, all exhibited nuclear 
staining qualitatively comparable to the positive control of 293 cells infected 
with AdRSVjffgal (1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 

30 Western blot analysis under denaturing conditions using the same antibody. 

Several lines in which the antibody detected a protein of the predicted molecular 
weight (62 kd for the Ad5 fiber protein) were selected for further study including 
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21 1 A, 21 1 B and 21 1 R. The 21 1 A cell line has been deposited with ATCC as 
described in Example 4. 

Immunoprecipitation analysis using soluble nuclear extracts from these 
three cell lines and a seminatiye electrophoresis system demonstrated that the 
5 fiber protein expressed is in the functional trimeric form characteristic of the 
native fiber protein. The predicted molecular weight of a trimerized fiber is 186 
kd. Under denaturing conditions, the trimeric form was destroyed resulting in 
detectable fiber monomers. Those clones containing endogenous E1, newly 
expressed recombinant E4 and fiber proteins were selected for use in 
10 complementing adenovirus gene delivery vectors having the corresponding 
adenoviral genes deleted as described in Example 2. 

D. > Preparation of an E1 -Expressing Plasmid for Complementation of 

. ? ..; r ,v t i-;'E1--Gene-Deleteci Adenoviruses 

In order to prepare adenoviral packaging cell lines other than those based 

15 on the E1-gene containing 293 cell line as described in Example 1C above, 

plasmid vectors containing E1 alone or in various combinations with E4 and fiber 
genes are constructed as described below. 

The region of the adenovirus genome containing the E1a and E1b gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 

20 used are E1L, the 5' or forward primer, and E1R, the 3' or backward primer, 
having the respective nucleotide sequences 5'CCG A GCTAGC 
GACTGAAAATGAG3' (SEQ ID NO: 10) and 5'CC TCTCGAG AGACAGC 
AAGACAC3' (SEQ ID NO: 11). The E1L and E1R primers include the respective 
restriction sites Nhel and Xhol as indicated by the underlines. The sites are used 

25 to clone the amplified E1 gene fragment into the Nhel/Xhol sites in pMAM 
commercially available from Clontech (Palo Alto, CA) to form the plasmid 
pDEX/E1 having 1 1 1 52 bp. 

The complete nucleotide sequence of pDEX/E1 is listed in SEQ ID NO: 12 
where the nucleotide position 1 corresponds to approximately 1454 nucleotides 

30 from the 3' end of the pMAM backbone vector sequence. The pDEX/E1 plasmid 
includes nucleotides 552 to 4090 of th aden virus genome positioned 
downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
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pDEX/E1 plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 

(MMTV) promoter of pMAM. The pMAM vector contains the E. coli gpt gene 

that allows stable transfectants to be isolated using hypoxanthine/amino- 

pterin/thymidine (HAT) selection. The pMAM backbone occupies nucleotide 

5 positions 1-1454 and 5005-1 1 152 of SEQ ID NO: 12. 

E. Generation of an Adenovirus Packaging Cell Line Carrying Plasmids 
Encoding Functional E1, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of 

the 211 sublines, 211 A, 21 1B and 21 1R, as described in Example 1C, 

10 alternative cell lines lacking adenoviral genomes are selected for transfection 

with the plasmid constructs as described below. Acceptable host cells include 

A549, Hela, Vero and the like cell lines as described in Example 1. The selected 

cell line is transfected with the separateVplas^ 

respectively for expressing E1, and fiber complementary proteins. Following 
1 5 transfection procedures as previously described, clones containing stable 

insertions of the two plasmids are isolated by selection with neomycin and HAT. 

Integration of full-length copy of the E1 gene is assessed by PCR amplification 

from genomic DNA using the primer set E1L/E1R , as described above. 

Functional insertion of the fiber gene is assayed by staining with the anti-fiber 
20 antibody as previously described. 

The resultant stably integrated cell line is then used as a packaging cell 

system to complement adenoviral gene delivery vectors having the 

corresponding adenoviral gene deletions as described in Example 2. 

F. Preparation of a Plasmid Containing Two or More Adenoviral Genes for 
25 Complementing Gene-Deleted Adenoviruses 

The methods described in the preceding Examples rely on the use of two 

plasmids, pE4/Hygro and pCLF, or, pCLF and pDEX/E1 for generating adenoviral 

cell packaging systems. In alternative embodiments, complementing plasmids 

containing two or more adenoviral genes for expressing of encoded proteins in 

30 various combinations are also prepared as described below. The resultant 

plasmids are then used in various cell systems with delivery plasmids having the 

corresponding adenoviral gene deletions. The selection of packaging cell, 

content of the delivery plasmids and content of the complementing plasmids for 
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use in generating recombinant adenovirus viral vectors thus depends on whether 
other adenoviral genes are deleted along with the adenoviral fiber gene, and, if 
so, which ones. 

1 . Preparation of a Complementing Plasmid Containing Fiber 
5 and E1 Adenoviral Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus 

tripartite leader, fiber gene and bovine growth hormone terminator is amplified 

from pCLF prepared in Example 1B using the forward primer 

5'GACGGATCGGGAGATCTCC3' (SEQ ID NO: 13), that anneals to the 

10 nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the backward 

primer 5'eCGCCTCAGAAGCCATAGAGCC3' {SEQ ID NO: 14) that anneals to , 

t ... .nucleotides 1 278-1 257 of the pCDNA3 vector backbone. The fragment /is-, .3 : U 

^amjplified as previously described and then cloned into the pDEX/E1 pla$midf^# a 

v prepared in Example 1D. For cloning in the DNA fragment, the pDEX/E1 vector* 

15 is first digested with Ndel, that cuts at a unique site in the pMAM vector 

backbone in pDEX/E1, then the ends are repaired by treatment with 

bacteriophage T4 polymerase and dNTPs. 

The resulting plasmid containing El and fiber genes, designated 

pE1 /Fiber, provides dexamethasone-inducible E1 function as described for 

20 DEX/E1 and expression of Ad 5 fiber protein as described above. 

The complete nucleotide sequence of pE1 /Fiber is listed in SEQ ID NO: 15 

where the nucleotide position 1 corresponds to approximately 1459 nucleotides 

from the 3' end of the parent vector pMAM sequence. The 5' and 3' ends of 

the Ad5 E1 gene are located at respective nucleotide positions 1460 and 4998 

25 followed by pMAM backbone and then separated from the Ad5 fiber from pCLF 

by the filled-in blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber gene 

fragment are located at respective nucleotide positions 10922-14223 containing 

elements as previously described for pCLF. 

The resultant pE1 /Fiber plasmid is then used to complement one or more 

30 delivery plasmids expressing E1 and fiber. 

The pE1 /Fiber construct is then us d to transfect a selected host cell as 

described in Example 1 E to generat stable chromosomal insertions preformed as 
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previously described followed by selection on HAT medium. The stable cells are 

then used as packaging cells as described in Example 2. 

2. Preparation of a Complementing Plasmid Containing E4 and 
Fiber Adenoviral Genes 

5 Plasmid pCLF prepared as described in Example 1 B is partially digested 

with Bglll to cut only at the site in the pCDNA3 backbone. The pE4/Hygro 

plasmid prepared in Example 1 A is digested with BamHI to produce a fragment 

containing E4. The E4 fragment is then inserted into the BamHI site of pCLF to 

form plasmid pE4/Fiber. The resultant plasmid provides expression of the fiber 

10 gene as described for pCLF and E4 function as described for pE4/Hygro. 

A schematic plasmid map of pE4/Fiber, having 10610 bp. The complete 
nucleotide sequence of pE^/FiberJsJstedJn SEQ ID NO: 16 where the 
nucleotide position 1,corresjDpp^;to approximately 14 bp from the 3' end of the 
parent vector pCDNA3 backbone sequence. The 5', and 3' ends of the Ad5 E4 

15 gene are located at respective nucleotide positions 21 and 3149 followed by 
fused Bglll/BamHI sites and pCDNA3 backbone including the CMV promoter 
again followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll 
sites and the 5' and 3' ends of the fiber gene located at respective nucleotide 

20 positions 4372 and 6124. 

Stable chromosonal insertions of pE4/Fiber in host cells are obtained as 
described above. 

EXAMPLE 2 

Preparation of Adenoviral Gene Delivery Vectors Using Adenoviral Packaging Cell 
25 Lines 

Adenoviral delivery vectors are prepared to separately lack the 
combinations of E1 /fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes. 
A preferred adenoviral delivery vector is replication competent but only via a 
30 non-fiber means is one that only lacks the fiber gene but contains the remaining 
functional adenoviral regulatory and structural genes. Furthermore, thes 
adenovirus delivery vectors have a higher capacity for insertion of foreign DNA. 
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A. Preparation of Adenoviral Gene Delivery Vectors Having Specific Gene 
Deletions and Methods of Use 

To construct the E1/ fiber deleted viral vector containing the LacZ 
reporter gene construct, two new plasmids were constructed. The plasmid pA 
5 E1 Bjffgal was constructed as follows. A DNA fragment containing the SV40 
regulatory sequences and E. colt jff-galactosidase gene was isolated from pSV/7gal 
(Promega) by digesting with Vspl, filling the overhanging ends by treatment with 
Klenow fragment of DNA polymerase I in the presence of dNTP's and digesting 
with Bam H1. The resulting fragment was cloned into the EcoRV and BamHI 
10 sites in the polylinker of pA E1sp1B {Microbix Biosystems, Hamilton, Ontario) to 
^form pA E1B /?gal that therefore contained the left end of the adenovirus-genome 
- with the Ela region replaced by the LacZ cassette (nucleotides 6690 to 4151) of 
pSV#gal. Plasmid DNA may be prepared by the alkaline lys!S^met^^pd , as i 
described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1*523 (1978) or by?the 
15 Quiagen method according to the manufacturer's instruction, from transformed 
cells used to expand the plasmid DNA was then purified by CsCI-ethidium 
bromide density gradient centrifugation. Alternatively, plasmid DNAs may be 
purified from E coli by standard methods known in the art (e.g. see Sambrook et 
al.) 

20 The second plasmid (pDV44), prepared as described herein, is derived 

from pBHGIO, a vector prepared as described by Bett et al., Proc. Natl. Acad. 
ScL, USA, 91:8802-8806 (1994) (see, also International PCT application No. 
WO 95/00655) using methods well known to one of skill in the art. This vector 
is also commercially available from Microbix and and contains an Ad 5 genome 

25 with the packaging signals at the left end deleted and the E3 region (nucleotides 
28133:30818) replaced by a linker with a unique site for the restriction enzyme 
PacL An 1 1 .9 kb BamHI fragment, which contains the right end of the 
adenovirus genome, is isolated from pBHGIO and cloned into the BamHI site of 
pBS/SK( + ) to create plasmid p1 1.3 having approximately 14,658 bp. The p1 1.3 

30 plasmid was then digested with Pad and Sail to remove the fiber, E4, and 
inverted terminal repeat (ITR) sequences. 
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This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PCR amplification from 
pBHGIO using the following oligonucleotide sequences: 
5' TQTACACCG GATCCGGCGCACACC3' SEQ ID NO: 17; and 
5 5'CACAACGAGCTC AATTAATTAATTGCCACATCCTC3 f SEQ ID NO: 18. 

These primers incorporated sites for Pad and BamHI. Cloning this fragment into 
the Pact and blunt ended Sail sites of the p1 1 .3 backbone resulted in a 
substitution of the fused ITRs, E4 region and fiber gene present in pBHGIO, by 
the ITRs and E4 region alone. The resulting p1 1 .3 plasmid containing the ITR 

10 and E4 regions, designated plasmid pDV43a, was then digested with BamHI. 
This BamHI fragment was therffused to replace a BamHI fragment in pBHGIO 
thereby creating pDV44 In a pBHGIO backbone. 

In an al^^atr/e''aj)}>rbach to preparing pDV44 with an additional 
subcloning step to facilitate the incorporation of restriction cloning sites, the 

15 following cloning procedure was performed. pDV44 as above was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGIO 
(Microbix Biosystems). As above, to simplify manipulations, the 1 1.9 kb BamHI 
fragment including the rightmost part of the Ad5 genome was removed from 
pBHGIO and inserted into pBS/SK. The resulting plasmid was termed p1 1 .3. 

20 The 3.4 kb DNA fragment corresponding to the E4 region and both ITRs of 
adenovirus type 5 was amplified as described above from pBHGIO using the 
oligonucleotides listed above and subcloned into the vector pCR2.1 (Invitrogen) 
to create pDV42. This step is the additional cloning step to facilitate the 
incorporation of a Sail restriction site. pDV42 was then digested with Pad, 

25 which cuts at a unique site (bold type) in one of the PCR primers, and with Sail, 
which cuts at a unique site in the pCR2.1 polylinker. This fragment was used to 
replace the corresponding Pacl/Xhol fragment of pi 1.3 (the pBS polylinker 
adjacent to the Ad DNA fragment contains a unique Xhol site), creating pDV43. 

30 A plasmid designated pDV44 was constructed by replacing the 1 1 .9 kb 

BamHI fragment of pBHGIO by the analogous BamHI fragment of pDV43. As 
generated in th first procedure, pDV44 therefore differs from pBHGIO by the 
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deletion of Ad 5 nucleotides 30819:32743 (residual E3 sequences and all but the 
3'-most 41 nucleotides of the fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a fiber-deleted Ad 5 genomic plasmid (pDV44) that was 
5 constructed by removing the fiber gene and some of the residual E3 sequences 
from pBHGIO. pDV44 contains a wild-type E4 region, but only the last 41 
nucleotides of the fiber ORF (this sequence was retained to avoid affecting 
expression of the adjacent E4 transcription unit). Plasmids pBHGIO and pDV44 
contain unpackageable Ad5 genomes, and must be rescued by cotransfection 

10 and subsequent homologous recombination with DNA carrying functional 

packaging signals. In order to generate vectors marked with a 'reporter gene, 
either pDV44 or pBHGIO was cptransfected with 'pAEIBBgal, which contains * 
the left end of the Ad5 genome with an SV40-dfiVen ; B-galactosidase reporter 
gene inserted in place of the E1 region. 

15 In general, and as described below, the method for virus production by 

recombination of plasmids followed by complementation in cell culture involves 
the isolation of recombinant viruses by cotransfection of any one of the 
adenovirus packaging cell systems prepared in Example 1 , namely 21 1 A, 21 1B, 
21 1R, A549, Vero cells, and the like, with plasmids carrying sequences 

20 corresponding to viral gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pDV44 and 
pAE1 Bft gal using the calcium phosphate method as described by Bett et al., 
Proc. Natl. Acad. ScL, USA, 37:8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 

25 full-length viral chromosome where pDV44 and pAE1B/? gal recombine to form a 
recombinant adenovirus vector having multiple deletions. The deletion of E1 and 
of the fiber gene from the viraF chromosome is compensated for by the 
sequences integrated into the packaging cell genome, and infectious virus 
particles are produced. The plaques thus generated are isolated and stocks of 

30 the recombinant virus are produced by standard methods. 

Because of the fiber deletion, a pDV44-derived virus is 
replication-defective, cells in which it is grown must complement this defect. 
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The 21 1B cell line {a derivative of 293 cells which expresses the wild-type (wt) 
AD5 fiber and is equivalent to 21 1 A on deposit with ATCC as described in 
Example 4) was used for rescue and propagation of the virus described here. 
pDV44 and pAEIBgal were cotransfected into 21 1B cells, and the monolayers 
5 were observed for evidence of cytopathic effect (CPE). Briefly, for virus 

construction, cells were transfected with the indicated plasmids using the Gibco 
Calcium Phosphate Transfection system according to the manufacturer's 
instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 

10 death at day 15. A crude freeze-thaw lysate was prepared from these cells and 
the resulting virus (termed Ad5.fSgal.AF) was plaque purified twice and then 
expanded. ^To^prepafe purified viral preparations, cells were infected with the * 
1 indic^^^^aind observed for completion of CPE. Briefly, at day zero, 21 1B 
cells were plated in DMEM plus 10% fetal calf serum at approximately 1 X 10 7 

15 cells/150 cm 2 flask or equivalent density. At day one, the medium was replaced 
with one half the original volume of fresh DMEM containing the indicated Ad, in 
this case Ad5.dgal.AF, at approximately 100 particles/cell. At day two, an equal 
volume of medium was added to each flask and the cells were observed for CPE. 
Two to five days after infection, cells were collected and virus isolated by lysis 

20 via four rapid freeze-thaw cycles. Virus was then purified by centrifugation on 
preformed 15-40% CsCI gradients (1 1 1,000 x g for three hours at 4°C). The 
bands were harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% 
NaCI, and 10% glycerol), aliquoted and stored at - 70°C. Purified Ad5.Bgal.AF 
virus particles containing human adenovirus Ad5.Sgal.AFgenome (described 

25 further below) have been deposited with the ATCC on January 1 5, 1 999 as 
further described in Example 4. 

For viral titertng, as necessary in the below Examples, Ad preparations 
were titered by plaque assay on 21 1B cells. Cells were plated on 
polylysine-coated 6 well plates at 1.5 x 10 8 cells/well. Duplicate dilutions of 

30 virus stock wer added to the plates in 1 ml/well of complete DMEM. After a 
five hour incubation at 37 °C, virus was removed and the wells overlaid with 2 
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ml of 0.6% low-melting agarose in Medium 199 (Gibco). An additional 1 ml of 
overlay was added at five day intervals. 

As a control, the first-generation virus Ad5.S gal.wt, which is identical to 
Ad5.Bgal.AF except for the fiber deletion, was constructed by cotransfection of 
5 pBHGIO. and pAEIBBgal. In contrast to the low efficiency of recovery of the 
fiberless genome (1/58 dishes), all of 9 dishes cotransfected with pAE1B£gal 
and pBHGIO produced virus. 

In another embodiment, a delivery plasmid is prepared that does not 
require the above-described recombination events to prepare a viral vector 

10 having a fiber gene deletion. In one embodiment, a single delivery plasmid 
containing all the adenoviral genome necessary for packaging but lacking the 
fiber gene is prepared from plasmid pFGX^O^cpntainihg full-length Ad5 that is 
commercially available from Microbi^i^hle^re delivery plasmid referred to 

as pFG140-f is then used with pCLF stably integrated cells as described above to 

1 5 prepare a viral vector lacking fiber. For genetic therapy, the fiber gene can be 
replaced with a therapeutic gene of interest for preparing a therapeutic delivery 
adenoviral vector. Methods for producing a fiberless vector with a complete TPL 
are described in Example 3. 

Vectors for the delivery of any desired gene and preferably a therapeutic 

20 gene are prepared by cloning the gene of interest into the multiple cloning sites 
in the polylinker of commercially available pAE1sp1B (Microbix Biosystems), in 
an analogous manner as performed for preparing pE1B£ gal as described above. 
The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors as further discussed in later 

25 Examples. 

1 . Characterization of the Ad5./?gaI.AF Genome 
To confirm that the vector genomes had the proper structures and that 
the fiber gene was absent from the Ad5.Bgal.AF chromosome, the DNA isolated 
from viral particles was analyzed. Briefly, purified viral DNA was obtained by 
30 adding 10 /il of 10 mg/ml proteinase K, 40 fA of 0.5 M EDTA and 50 //I of 10% 
SDS to 800 //I f ad novirus-containing culture supernatant. The suspension 
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was then incubated at 55 °C for 60 minutes. The solution was then extracted 
once with 

400 //I of a 24:1 mixture of chloroform:isoamyl alchohol. The aqueous phase 

was then removed and precipitated with sodium acetate/ethanol. The pellet was 

5 washed once with 70% ethanol and lightly dried. The pellet was then 

suspended in 40 //I of 10 mM Tris-HCI, pH 8.0, 1 mM EDTA. Genomic DNA 

from Ad5.Bgal.wt and Ad5.ftgal.AF produced the expected restriction patterns 

following digestion with either EcoRI or with Ndel. Southern blotting, performed 

with standard methods, with labeled fiber DNA as a probe demonstrated the 

10 presence of fiber sequence in Ad5.Bgal.wt but not in Ad5.Bgal.AF DNA. As a 

positive: control, the blot was stripped and reprobed with labeled E4 sequence. ^ 

: Fiber arid E4 sequences were detected by using labeled inserts from -pCLF ahd^ 

/i^pE4/Hygrc^ respectively. E4 signal was readily detectable in both^en"6#iel^t 

equal intensities! The complete nucleotide sequence of Ad5.ftgal.AF is presented 

15 in SEQ ID NO: 23 and is contained in the virus particle on deposit with ATCC. 

2. Characterization of the Fiberless Adenovirus Ad5.jffgal.AF 

To verify that Ad5.ftgal.AF was fiber-defective, 293 cells (which are 

permissive for growth of E1 -deleted Ad vectors but do not express fiber) were 

infected with Ad5.Bgal.AF or with Ad5.Bgal.wt. Twenty-four hours post 

20 infection, the cells were stained with polyclonal antibodies directed either against 

fiber or against the penton base protein. Cells infected with either virus were 

stained by the anti-penton base antibody, while only cells infected with the 

Ad5.Bgal.wt control virus reacted with the anti-fiber antibody. This confirms 

that the fiber-deleted Ad mutant does not direct the synthesis of fiber protein. 

25 3. Growth of the Fiber-Deleted Ad5./?gal.AF Vector in 

Complementing Cells 

Ad5.ftgal.AF was found to readily be propagated in 21 1 B cells. As 

assayed by protein concentration, CsCI-purified stocks of either Ad5.Bgal.AF or 

Ad5.ftgal.wt contained similar numbers of viral particles. The particles appeared 

30 to band normally on CsCI gradients. Infectivity of the Ad5.ftgal.AF particles was 

lower than the Ad5.ftgal.wt control, as indicated by an increas d particle/PFU 

ratio. Ad5.ftgal.AF was also found to plaque more slowly than the control 
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virus. When plated on 21 1B cells, Ad5.Bgal.wt plaques appeared within 5-7 
days, while plaques of Ad5.Sgai.AF continued to appear until as much as 15-18 
days post infection. Despite their slower formation, the morphology of 
Ad5.Bgal.AF plaques was essentially normal. 
5 4. Production of Fiberless Ad5.Bgal.AF Particles 

As Ad5.Bgal.AF represents a true fiber null mutation and its stocks are 
free of helper virus, the fiber mutant phenotype was readily investigated. A 
single round of growth in cells (such as 293) which do not produce fiber 
generating a homogeneous preparation of fiberless Ad allowed for the 

10 determination of whether such particles would be stable and/or infectious. 
Either Ad5.Bgal.wt or Ad5.Bgal.AF waslgrown in 293 or 21 1B cells, and the 
resulting particles purified o& OsGkgradients as previously described. 
Ad5.Bgal.AF particles wVireycei^ii^roduced'iii 293 cells at approximately the 
same level as the control virus and behaved similarly on the gradients, indicating 

15 that there was not a gross defect in morphogenesis of fiberless capsids. 

Particles of either virus contained similar amounts of penton base 
regardless of the cell type in which they were grown. This demonstrated that 
fiber is not required for assembly of the penton base complex into virions. The 
Ad5.Bgal.AF particles produced in 293 cells did not contain fiber protein. 

20 21 IB-grown Ad5.Bgal.AF also contained less fiber than the Ad5.Bgal.wt control 
virus. The infectivities of the different viral preparations on epithelial cells 
correlated with the amount of fiber protein present. The fiberless Ad particles 
were several thousand-fold less infectious than the first-generation vector control 
on a per-particle basis, while infectivity of 21 IB-grown Ad5.Bgal.AF was only 

25 50-100 fold less than that of Ad5.Bgal.wt. These studies confirmed fiber's 

crucial role in infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless Ad5.Bgal.AF 
Particles 

The proteins contained in particles of 293-grown Ad5.BgaLAF were 
30 compared to those in Ad5.Bgal.wt f to determine whether proteolysis or particle 
assembly was defectiv in this fiber null mutant. The verall pattern of proteins 
in the fiberless particles was observed to be quite similar to that of a 



WO 01/83729 



PCT/EP01/04863 



-67- 

first-generation vector, with the exception of reduced intensity of the composite 
band resulting from proteins Ilia and IV (fiber). The fiberless particles also had a 
reduced level of protein VII. Although substantial amounts of uncleaved 
precursors to proteins VI, VII, and VIII were not seen, it is possible that the 
5 low-molecular weight bands migrating ahead of protein VII represent either 
aberrantly cleaved viral proteins or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the structure 
of the 293 grown Ad5.Bgal.AF and of Ad5Bgal.wt. The fiber, having an 
extended stalk with a knob at the end, was faintly visible in favorable 

10 orientations of wild-type Ad5 particles, but not in images of the fiberless 

:( particles. Filamentous material likely corresponding to free viral -DNA^was seen 
in micrographs of fiberless particles. This material was .alsg present- in " 
micrographs of the first-generation control virus, albeit at much 'lower levels'.' , 
*■ Three-dimensional image reconstructions of fiberless and wild-type 

15 particles at —20 A resolution showed similar sizes and overall features, with the 
exception that fiberless particles lacked density corresponding to the fiber 
protein. The densities corresponding to other capsid proteins, including penton 
base and proteins Ilia, VI, and IX, were comparable in the two structures. This 
confirms that absence of fiber does not prevent assembly of these components 

20 into virions. The fiber was truncated in the wild-type structure as only the lower 
portion of its flexible shaft follows icosahedral symmetry. The RGD protrusions 
on the fiberless penton base were angled slightly inward relative to those of the 
wild-type structure. Another difference between the two penton base proteins 
was that there is a ~30 A diameter depression in the fiberless penton base 

25 around the five-fold axis where the fiber would normally sit. The Ad5 

reconstructions confirm that capsid assembly, including addition of penton base 

to the vertices, is able to proceed in the complete absence of fiber. 

6. Integrin-Dependent Infectivtty of Fiberless Ad5.Bgal.AF 
Particles 

30 While attachment via the viral fiber protein is a critical step in the 

infection of epithelial cells, an alternative pathway for infection of certain 
hematopoietic cells has been described. In this case, penton base mediates 
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binding to the cells (via S2 integrins) and internalization (through interaction with 
av integrins). Particles lacking fiber might therefore be expected to be 
competent for infection of these cells, even though on a per-particle basis they 
are several thousand-fold less infectious than normal Ad vectors on epithelial 
5 cells. 

To investigate this, THP-1 monocytic cells were infected with 
Ad5.Bgal.wt or with Ad5.Sgal.AF grown in the absence of fiber. Infection of 
THP-1 cells was assayed by infecting 2 x 10 s cells at the indicated m.o.i. in 0.5 
ml of complete RPMI. Forty-eight hours post-infection, the cells were fixed with 

10 glutaraldehyde and stained with X-gal, and the percentage of stained cells was 

determined by light microscopy. The results of the infection assay showed that * < 

the f iberless patficilesi were only a few-fold less infectious than first-generation 
Ad on THP-1^ceJ|s.v*l:aTge differences were seen in plaquing efficiency on 
epithelial (21 1 B) cells. Infection of THP-1 cells by either Ad5.Bgal.AF or 

15 Ad5.Bgal.wt was not blocked by an excess of soluble recombinant fiber protein, 
but could be inhibited by the addition of recombinant penton base). These 
results indicate that the fiberless Ad particles use a fiber-independent pathway to 
infect these cells. Furthermore, the lack of fiber protein did not prevent 
Ad5.&galAF from internalizing into the cells and delivering its genome to the 

20 nucleus, demonstrating that fiberless particles are properly assembled and are 
capable of uncoating. 

The foregoing results with the recombinant viruses thus produced 
indicates that they can be used as gene delivery tools in cultured cells and in 
vivo as described more fully in the Examples. For example, for studies of the 

25 effectiveness and relative immunogenicity of multiply-deleted vectors, virus 
particles are produced by growth in the packaging lines described in Example 1 
and are purified by CsCI gradient centrifugation. Following titering, virus 
particles are administered to mice via systemic or local injection or by aerosol 
delivery to lung. The LacZ reporter gene allows the number and type of cells 

30 which ar successfully transduced to be evaluated. The duration of transgene 
expression is evaluated in order to determine the long-t rm effectiveness of 
treatment with multiply-deleted recombinant adenoviruses relative to the 
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standard technologies which have been used in clinical trials to date. The 

immune response to the improved vectors described here is determined by 

assessing parameters such as inflammation, production of cytotoxic T 

lymphocytes directed against the vector, and the nature and magnitude of the 

5 antibody response directed against viral proteins. 

Versions of the vectors which contain therapeutic genes such as CFTR 

for treatment of cystic fibrosis or tumor suppressor genes for cancer treatment 

are evaluated in the animal system for safety and efficiency of gene transfer and 

expression. Following this evaluation, they are used as experimental therapeutic 

10 agents in human clinical trials. 

B. Retargeting of Adenoviral Gene Delivery Vectors by Producing 
Viral Particles Containing Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely 

determined by the fiber protein, viral particles that incorporate modified fiber 

1 5 proteins or fiber proteins from different adenoviral serotypes (pseudotyped 
vectors) have different specificities. Thus, the methods of expression of the 
native Ad 5 fiber protein in adenovirus packaging cells as described above is also 
applicable to production of different fiber proteins. 

Chimeric fiber proteins can be produced according to known methods 

20 {see, e.g., Stevenson etal. (1995) J. Virol., 63:2850-2857), Determinants for 
fiber receptor binding activity are located in the head domain of the fiber and an 
isolated head domain is capable of trimerization and binding to cellular receptors. 
The head domains of adenovirus type 3 (Ad3) and Ad5 were exchanged in order 
to produce chimeric fiber proteins. Similar constructs for encoding chimeric fiber 

25 proteins for use in the methods herein are contemplated. Thus, instead of the 
using the intact Ad 5 fiber-encoding construct prepared in above and in U.S. 
application Serial No. 09/482,682) as a complementing viral vector in adenoviral 
packaging cells, the constructs described herein are used to transfect cells along 
with E4 and/or E1 -encoding constructs. 

30 Briefly, full-length Ad5 and Ad3 fiber genes were amplified from purified 

adenovirus genomic DNA as a template. The Ad5 and Ad3 nucleotide 
sequences are available with the respective GenBank Accession Numbers 
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M18369 and M1241 1 . Oligonucleotide primers are designed to amplify the 
entire coding sequence of the full-length fiber genes, starting from the start 
codon, ATG, and ending with the termination codon TAA. For cloning purposes, 
the 5' and 3' primers contain the respective restriction sites BamHI and Notl for 
5 cloning into pcDNA plasmid as described in Example 1 A. PCR is performed as 
described above. 

The resulting products are then used to construct chimeric fiber 
constructs by PCR gene overlap extension (Horton et al. (1990) BioTechniques, 
5:525-535), The Ad5 fiber tail and shaft regions (5TS; the nucleotide region 

10 encoding amino acid residue positions 1 to 403) are connected to the Ad3 fiber 
head region {3H; t$e nucleotide region encoding amino acid residue positions 
136^^19)^6^6^ the 5TS3H fiber chimera. Conversely, the Ad3 fiber taiPand 
' shijftj r^gi6ns>'(3TS; the nucleotide region encoding amino acid residues positionW 
1 to i'35) are connected to -the Ad5 fiber head region (5H; the nucleotide region 5 

15 encoding the amino acid residue positions 404 to 581) to form the 3TS5H fiber 
chimera. The fusions are made at the conserved TLWT (SEQ ID NO: 19) 
sequence at the fiber shaft-head junction. 

The resultant chimeric fiber PCR products are then digested with BamHI 
and Notl for separate directional ligation into a similarly digested pcDNA 3.1 . 

20 The TPL sequence is then subcloned into the BamHI as described in Example 1 A 
for preparing an expression vector for subsequent transfection into 211 cells as 
described above or into the alternative packaging cell systems as previously 
described. The resultant chimeric fiber construct-containing adenoviral 
packaging cell lines are then used to complement adenoviral delivery vectors as 

25 previously described. Other fiber chimeric constructs are obtained with the 
various adenovirus serotypes using a similar approach. 

In an alternative embodiment, the use of modified proteins including with 
modified epitopes {see, e.g., Michael eta/. (1995) Gene Therapy, 2:660-668 
and International PCT application Publication No. WO 95/2641 2, which describe 

30 the construction of a cell-type specific therapeutic viral vector having a new 
binding specificity incorporated int the virus concurrent with the destruction of 
the endogenous viral binding specificity). In particular, th authors described the 
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production of an adenoviral vector encoding a gastrin releasing peptide (GRP) at 
the 3' end of the coding sequence of the Ad5 fiber gene. The resulting fiber- 
6RP fusion protein was expressed and shown to assemble functional fiber 
trimers that were correctly transported to the nucleus of HeLa cells following 
5 synthesis. 

Similar constructs are contemplated for use in the complementing 
adenoviral packaging cell systems for generating new adenoviral gene delivery 
vectors that are targetable, replication-deficient and less immunogenic. 
Heterologous ligands contemplated for use herein to redirect fiber specificity 

10 range from as few as 10 amino acids in size to large globular structures, some of 
which necessitate the addition of a spacer region so as to reduce or preclude 
steric hindrance of the heterologous ligariti- with the fiber or prevent trimerization 
of the fiber protein. The ligands^elMerted at the'end or within the linker 
region. Preferred ligands include those that target specific cell receptors or 

15 those that are used for coupling to other moieties such as biotin and avidin. 
A preferred spacer includes a short 12 amino acid peptide linker 
composed of a series of serines and alanine flanked by a proline residue at each 
end using routine procedures known to those of skill in the art. The skilled 
artisan will be with the preparation of linkers to accomplish sufficient protein 

20 presentation and to alter the binding specificity of the fiber protein without 
compromising the cellular events that follow viral internalization. Moreover, 
within the context of this disclosure, preparation of modified fibers having 
ligands positioned internally within the fiber protein and at the carboxy terminus 
as described below are contemplated for use with the methods described herein. 

25 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of 
choice, site-directed mutagenesis is used to insert the coding sequence for a 
linker into the 3' end of the Ad5 fiber construct in pCLF as prepared in Example 
30 1, 

The 3' or anttsense or mutagenic oligonucleotid encodes a preferred 
linker sequenc of ProSerAlaSerAlaSerAlaSerAlaProGlySer (SEQ ID NO: 20) 
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followed by a unique restriction site and two stop codons, respectively, to allow 
the insertion of a coding sequence for a selected heterologous ligand and to 
ensure proper translation termination. Flanking this linker sequence, the 
mutagenicoligonucloetide contains sequences that overlap with the vector 
5 sequence and allow its incorporation into the construct. Following mutagenesis 
of the pCLF sequence adding the linker and stop codon sequences, a nucleotide 
sequence encoding a preselected ligand is obtained, linkers corresponding to the 
unique restriction site in the modified construct are attached and then the 
sequence is cloned into linearized corresponding restriction site. The 
10 resultant fiber-ligand construct is then used to transfect 21 1 or the alternative 
, cell. packaging systems previously described to produce complementing viral ^ 
i.. f vit vector packaging systems. . Vi 1 Ui'* i<?,F 

,n a further embodiment, intact fiber genes from different^dV^iiqtypes 
rT are expressed by 21 1 cells or an alternative packaging system as previously 
15 described. A gene encoding the fiber protein of interest is first cloned to create 
a plasmid analogous to pCLF, and stable cell lines producing the fiber protein are 
generated as described above for Ad5 fiber. The adenovirus vector described 
which lacks the fiber gene is then propagated in the cell line producing the fiber 
protein relevant for the purpose at hand. As the only fiber gene present is the 
20 one in the packaging cells, the adenoviruses produced contain only the fiber 
protein of interest and therefore have the binding specificity conferred by the 
complementing protein. Such viral particles are used in studies such as those 
described above to determine their properties in experimental animal systems. 

EXAMPLE 3 

25 Tripartite leader sequences (TPLs) that are useful in enhancing the 

expression of complementing adenoviral proteins, particularly fiber protein, for 
use in preparing an adenoviral gene delivery vector are provided. The complete 
Ad5 TPL was constructed by assembling PCR fragments. First, the third TPL 
exon (exon 3) {nt 9644-9731 of the Ad5 genome) was amplified from Ad5 

30 genomic DNA using the synthetic oligonucleotid primers 

5'CTCAACAATTGTGGATCCGTACTCC3'(SEQ ID No. 24) and 
5'GTGCTCAGCAGATCTTGCGACTGTG3' (SEQ ID No. 25). The resulting 
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product was cloned to the BamHI and Bglll sites of pAE1Sp1a (Microbix 
Biosystems) using sites in the primers (shown in bold) to create plasmid pDV52. 
A fragment corresponding to the first TPL exon {exon 1 ), the natural first intron 
(intron 1), and the second TPL exon {exon 2) (Ad5 nt 6049-7182) was then 
5 amplified using primers 5'GGCGCGTTCGGATCCACTCTCTTCC3' (SEQ ID No, 
26) and 5XTACATGCTAGGCAGATCTCGTTCGGAG3' (SEQ ID No. 27),and 
cloned into the BamHI site of pDV52 (again using sites in the primers) to create 
pDV55. 

This plasmid contains a 1 .2 kb BamHI/Bglll fragment containing the first 
10 TPL exon, the natural first intron, and the fused second and third TPL exons. 
The nucleotide sequence of the complete TPL containing the noted 5' and 3' 
restriction sites is showmiri SEQ ID "No 28 with the following nucleotide regions 
identified: 1-6 fit Bamffi 'site;' 7-47 nt first leader segment (exon 1); 48-1068 nt 
natural first intron (intron 1); 1069-1 140 nt second leader segment (exon 2); 
15 1 141-1 146 nt fused BamHI and Bglll sites; 1 147-1 234 nt third leader segment 
(exon 3); and 1235-1240 nt Bglll site, 

EXAMPLE 4 

Deposit of Materials 

The following cell lines and plasmids were deposited on September 25, 

20 1996, with the American Type Culture Collection, 10801 University Blvd, 

Manassas, Virginia, USA (ATCC) under the provisions of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure and the Regulations thereunder (Budapest Treaty):Plasmid 
pE4/Hygro (accession number 97739), Plasmid pCLF (accession number 97737), 

25 21 1 Cell Line (accession number CRL-12193) and 21 1 A Cell Line (accession 
number CRL-1 21 94) 

The following virus, Ad5.jSgal.AF, was deposited on January 15, 1999, 
with the ATCC as listed above and provided with accession number VR2636. 
Additionally, plasmids pDV60, pDV67, pDV69, pDV80 and pDV90 were 

30 deposited at the ATCC on January 5, 2000 and provided with accession 
numbers PTA-1 144, PTA-1 145, PTA-1 146, PTA-1 147 and PTA-1 148 
respectively. 
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EXAMPLE 5 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids containing tripartite leaders (TPLs) have been constructed. The 

5 resulting plasmids that contain different selectable markers, such as neomycin 

and zeocin, were then used to prepare fiber-complementing stable cell lines for 

use as for preparing adenoviral vectors. 

A. pDV60 

Plasmid pDV60 was constructed by inserting this TPL cassette of SEQ ID 
10 No. 28 into the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber, a 
neomycin selectable plasmid {see, e.g., U.S. application Serial No. $9/482, 682 
(also filed as International PCT application No. PCT/US00/pp265 on January 14, 
- 2000); see also Von Seggern et al. ( ^^>r 

(1998) J. Gen Virol, 79: 1461-1468). The nucleotide sequence of pDV60 is 
15 listed in SEQ ID NO: 29. Plasmid pDV60 has been deposited in the ATCC under 
accession number PTA-1 144. 

B. pDV61 

To construct pDV61, an Asp718/Not! fragment containing the CMV 
promoter, partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone 

20 terminator was transferred from pCLF (ATCC accession number 97737; and 
described in copending U.S. application Serial No. 09/482,682 (also filed as 
International PCT application No. PCT/USOO/00265 on January 14, 2000);), to a 
zeocin selectable cloning vector referred to as pCDNA3.1/Zeo ( + ) (commericaliy 
available from Invitrogen and for which the sequence is known). 

25 C. pDV67 

In an analogous process, pDV67 containing complete TPL was 
constructed by transferring an Asp 718/Xbal fragment from pDV60 into 
pcDNA3.1/Zeo{ + ) backbone. The nucleotide sequence of pDV67 is set forth in 
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SEQ ID No. 30. Plasmid pDV67 is available from the ATCC under accession 
number PTA-1 145. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric 
5 Ad3/Ad5 fiber gene was amplified from pGEM5TS3H (Stevenson etal (1995) J. 
ViroL, 69: 2850-2857) using the primers 5'ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3' (SEQ ID No. 31) and 

5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID No. 32), and cloned to 
the BamHI and NotI sites of pcDNA3.1/Zeo( + ) via new BamHI and Notl sites 
10 engineered into the primers to create pDV68. Finally, the complete TPL 

fragment described above was then added to the unique BamHI site of pDV68 ■ * ■ 

to create pDV66.^The nucleotide sequence of pDV69 is listed in SEQ ID No. 33 * M /i« p 

and has been deposited in the 1 ATCC under accession number PTA-1 146. ! ^ k^IV > ^ 

E. ' Preparation of Stable Adenovirus Packaging Cell Lines 

15 E1-2a S8 cells are derivatives of the A549 lung carcinoma line (ATCC # 

CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 34) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 35), which provide complementation of the adenoviral E1 and E2a 

20 functions, respectively. This line and its derivatives were grown in Richter's 
modified medium (BioWhitaker) + 10% FCS. E1-2a S8 cells were 
electroporated as previously described (Von Seggern etal. (1998) J. Gen ViroL, 
79: 1461-1468) with pDV61, pDV67, or with pDV69, and stable lines were 
selected with zeocin (600//g/ml). 

25 The cell line generated with pDV61 is designated 601. The cell line 

generated with pDV67 is designated 633 while that generated with pDV69 is 
designated 644. Candidate clones were evaluated by immunofluorescent 
staining with a polyclonal antibody raised against the Ad2 fiber. Lines 
expressing the highest level of fiber protein were further characterized. 

30 For the S8 cell complementing cell lines, to induce E1 expression, 0.3 //M 

of dexamethasone was added to cell cultures 1 6-24 hours prior to challenge 
with virus for optimal growth kinetics. For preparing viral plaques, 5 X 10 5 
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celis/well in 6 well plates are prepared and pre-induced with the same 
concentration of dexamethasone the day prior to infection with 0.5 /;M included 
at a final concentration in the agar overlay after infection. 
F. Development of Cell Lines for Complementation of E17E2a* Vectors 
5 The Adenovirus 5 genome was digested with Seal enzyme, separated on 

an agarose gel, and the 6,095 bp fragment containing the left end of the virus 
genome was isolated. The complete Adenovirus 5 genome is registered as 
Genbank accession #M73260, incorporated herein by reference, and the virus is 
available from the American Type Culture Collection, Manassas, Virginia, U.S.A., 

10 under accession number VR-5. The Seal 6,095 bp fragment was digested 

further with Clal at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to 
? Bglll fragment was purified from an agarose gel arid ligated into the superlinker 
shuttle plasmid pSE280 (Invitrogeri, Sar?*DTeg&/i3A r ), which Was digested with 
Clal and Bglll, to form pSE280-E. 

15 Polymerase chain reaction (PCR) was performed to synthesize DNA 

encoding an Xhol and Sail restriction site contiguous with Adenovirus 5 DNA bp 
552 through 924. The primers which were employed were as follows: 
5' end, Ad5 bp 552-585: 

5'-GTCACTCGAGGACTCGGTC-GACTGAAAATGAGACATATTATCTGCCACGGA 
20 CC-3' (SEQ ID No 36) 
3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID No. 37) 

This amplified DNA fragment (sometimes hereinafter referred to as 
Fragment A) then was digested with Xhol and Clal, which cleaves at the native 
25 Clal site (bp 917), and ligated to the Xhol and Clal sites of pSE280-E, thus 

reconstituting the 5' end of the E1 region beginning 8 bp upstream of the ATG 
codon. 

PCR then was performed to amplify Adenovirus 5 DNA from bp 3,323 
through 4,090 contiguous with an EcoRI restriction site. The primers which 
30 were employed were as follows: 
5' end, Ad5 bp 3323-3360: 
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5'-CATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID No. 38); 
and 

3' end, Ad5 bp 4090-4060: 

5'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAAATCC 
5 -3' (SEQ ID No. 39). 

This amplified DNA fragment {sometimes hereinafter referred to as 
Fragment B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll 
site (bp 3,382) and EcoRI, and ligated to the Bglll and EcoRI sites of pSE280-AE 
to reconstruct the complete El a and E1b region from Adenovirus 5 bp 552 
10 through 4,090. The resulting plasmid is designated pSE280-E1. 

, A construct containing the intact E1a/b region under the control of the .; 
* ^^synth^tic^promoter GRE5 was prepared as follows. The intact E1a/b regionfvasR 
^exgised from pSE280-E1, which was modified previously to contain a Bamj#fcite^ 
3' to the E1 gene, by digesting with Xhol and BamHI. The Xhol to BamHI f 
15 fragment containing the E1a/b fragment was cloned into the unique Xhol and 
BamHI sites of pGRE5-2/EBV (U.S. Biochemicals, Cleveland, Ohio) to form 
PGRE5-E1). 

Bacterial transformants containing the final construct were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA prior to use for 

20 transfection of cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut 
at bp 21,562 and 27,080, respectively. Fragments were separated on an 
agarose gel and the 5,518 bp BamHt to Spel fragment was isolated. The 5,518 

25 bp BamHI to Spel fragment was digested further with Smal, which cuts at bp 
23,912. The resulting 2,350 bp BamHI to Smal fragment was purified from an 
agarose gel, and ligated into the superlinker shuttle plasmid pSE280, and 
digested with BamHI and Smal to form pSE280-E2 BamHI-Smal. 

PCR then was performed to amplify Adenovirus 5 DNA from the Smal site 

30 at bp 23,912 through 24,730 contiguous with Nhel and EcoRI restriction sites. 
The primers which were employed were as follows: 
5' nd, Ad5 bp 24,732-24,708: 
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5'-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID No. 40); 
3' end, Ad5 bp 23,91 2-23,934: 

5'-CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3' (SEQ ID No. 41) 

This amplified DNA fragment was digested with Smal and EcoRI, and 
5 ligated to the Smal and EcoRI sites of pSE280-E2 Bam-Sma to reconstruct the 
complete E2a region from Ad5 bp 24,730 through 21,562. The resulting 
construct is pSE280-E2a. 

In order to convert the BamHI site at the 3' end of E2a to a Sail site, the 
E2a region was excised from pSE280-E2a by cutting with BamHI and Nhel, and 

10 recloned into the unique BamHI and Nhel sites of pSE280. Subsequently, the 
E2a region was excised from this construction with Nhel and Sail in order to 
clone into the Nhel and Sail sites;?of fHe pMAMneo (Clonetech, Palo Alto, CA) 
multiple cloning site in a 5' fc&3£^ri§ntation, respectively. The resulting 
construct is pMAMneo E2a. 

15 Bacterial transformants containing the final pMAMneo-E2a were 

identified. Plasmid DNA was prepared and purified by banding in CsTFA. 
Circular plasmid DNA was linearized at the Xmnl site within the ampicillin 
resistance gene of pMAMneo-E2a, and further purified by the phenol/chloroform 
extraction and ethanol precipitation prior to use for transfection of cells. 

20 Transfection and selection of cells. 

In general, this process involved the sequential introduction, by calcium 
phosphate precipitation, or other means of DNA delivery, of two plasmid 
constructions each with a different viral gene, into a single tissue culture cell. 
The cells were transfected with a first construct and selected for expression of 

25 the associated drug resistance gene to establish stable integrants. Individual cell 
clones were established and assayed for function of the introduced viral gene. 
Appropriate candidate clones then were transfected with a second construct 
including a second viral gene and a second selectable marker. Transfected cells 
then were selected to establish stable integrants of the second construct, and 

30 cell clones were stablished. Cell clones were assayed for functional expression 
of both viral genes. 
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A549 (ATCC Accession No. CCL-185) were used for transfection. 
Appropriate selection conditions were established for G418 and hygromycin B by 
standard kill curve determination. . 

Transfection of A549 cells with plasmids including ET and E2a regions. 
5 pMAMNeo-E2a was linearized with XmnI with the Amp R gene, introduced 

into cells by transfection, and cells were selected for stable integration of this 
plasmid by G418 selection until drug resistant colonies arose. The clones were 
isolated and screened for E2a expression by staining for E2a protein with a 
polyclonal antiserum, and visualizing by immunofluorescence. E2a function was 
10 screened by complementation of the temperature-sensitive mutant Ad5ts1 25 
iyirus which contains a temperature-sensitive mutation in the E2a gene. (Vaii Der 
t fv >t *T, , v 'Vliet, et ai:, J. Virology, Vol. 15, pgs. 348-354 (1975)). Positives clones- ' v ' 
*&&y&yi*r expressing the E2a gene were identified and used for transfection f with the 7 kb 
EcoRV to XmnI fragment from pGRE5-E1, which contains the GRE5 promoted " 
15 E1a/b region plus the hygromycin 8 gene. Cells were selected for hygromycin 
resistance and assayed for E1a/b expression by staining with a monoclonal 
antibody for the ET protein (Oncogene Sciences, Uniondale, N.Y.). E1 function 
was assayed by ability to complement an E1 -deleted vector. At this point, 
expression and function of E2a was verified as described above, thus 
20 establishing the expression of E1a/b and E2a in the positive cell clones. 

A transfected A549 ( A549 (ATCC Accession No. CCL-185);) cell 
lines showed good E1a/b and E2a expression and was selected for further 
characterization. It was designated the S8 cell line. 

G. Preparation of Adenoviral Vectors Containing Ad5./?gal.AF Genome in S8 
25 Improved Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5.iffgal.AF (Ad5.yffgal.AF has 

been was deposited the ATCC under accession number VR2636) in S8 cells 

containing alternative forms of TPL for enhancing the expression of fiber 

proteins, the protocol as described in Example 2 for preparing Ad5.yffgal.AF in 

30 21 1 B cells was followed with the exception of pretreatment with 0.3 jjM 

dexamethasone for 24 hours as described above. Thus, viral particles with the 

wildtype Ad5 fiber protein on their surface and containing the fiberless 
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Ad5.jffgal.AF genome were produced in 633 cells. Particles produced in 644 
cells also contained the f iberless Ad5.jffgal.AF genome, but had the chimeric 
5T3H fiber protein, with the Ad3 fiber knob, on their surface. 

Thus, these viral preparations, prepared as described herein are useful for 
5 targeting delivery of the Ad5.jffgal.AF, Ad5.GFP.AF, or other similarly 

constructed fiberless genome with either wild-type or modified fibers. Preferably 
for purposes herein the fibers are from an Ad serotype D virus, more preferably 
from Ad37. 

EXAMPLE 6 

10 Pseudotyping and Infectivity of Recombinant Adenoviral Vectors 

Produced with Improved Fiber-Complementing Cell Lines 

A. Pseudotypting^of Ad5./?gal.AF 

To verify thati^enpviral vectors were produced had altered tropisms, 
viral particles were purified from either 633 or 644 cells and were then Western 
15 blotted and probed with a polyclonal rabbit antibody against the Ad2 fiber 
(which detects the Ad5 and chimeric 5T3H fiber proteins). 

B. Infectivity of Cells with 633 or 644 Generated Virus Particles The 
cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of Ad5.jffgal.AF and virus particles produced. 

20 Virus was then used to infect selected cell lines, including 21 1B, MRC-5 human 
fibroblasts, A-10 rat aortic endothelial cells, and THP-1 human monocytic cells. 
Unbound virus was removed by washing the cells and the cells were further 
incubated at 37 °C for 48 hours. Cells were then fixed with glutaraldehyde and 
stained with X-gai. The percentage of stained cells was then determined by light 

25 microscopy where all experiments were done in triplicate. 

The results indicated that adenoviral vectors could be retargeted by 
pseudotyping using packaging cell lines expressing different fiber proteins. 
Particles containing either fiber were equally infectious on 21 1 B cells, while 
MRC-5 fibroblasts and THP-1 cells were more readily infected by virus containing 

30 the chimeric fiber. The A-10 rat endothelial cells were more readily infected by 
particles containing the wildtype Ad 5 fiber protein. 
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EXAMPLE 7 

Transient Transcomplementation 

The ability of adenovirus type 5 (Ad5) to deliver therapeutic genes to cells 
is mediated by the' interaction of the adenoviral fiber protein with the 
5 coxsackievirus-adenoviral receptor (CAR). Because a wide-range of cells express 
CAR, it was thought that it would be difficult to use adenoviruses to deliver 
genes to specific cell types. A system for testing modified fiber genes to 
identify tropisms of interest is described in copending U.S. application Serial 
No. 09/482,682 (also filed as International PCT application No. 
10 PCT/USOO/00265 on January 14, 2000). An in vitro system has been 

developed that involves infection of tissue culture cells with a fiber-deleted Ad 
and transient co-transfection with a plasmid directing^ fiber expression. This 
system allows one to produce and evaluate modified ■filters expressed on a viral 
particle. This system can be used to produce therapeutic quantities of 

1 5 adenoviral vectors with modified fiber proteins, with such fibers having a new 
tropism added by insertion of a desired ligand into the fiber gene. These fibers 
may also have the natural tropism (/.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55 were prepared as described 
herein and in U.S. application Serial No. 09/482,682 (also filed as International 

20 PCT application No. PCT/US00/00265 on January 14, 2000). pDV60 is an 
pcDNA3.1 -based expression plasmid that contains the CMV promoter, Ad5 
tripartite leader, an intron, and the Ad5 fiber gene sequence. pDV55 contains 
no fiber gene and serves as the negative control. Ad5.£gal.AF and 21 1 B are 
described above. 293T cells are identical to 293 cells except they express an 

25 integrated SV40 large T antigen gene. HDF cells are human diploid fibroblasts. 
293T cells express CAR and a v integrins; HDF cells express a v integrins but no 
CAR. Transfections with fiber expression plasmids were performed with 
Lipofectamine (GIBCO-BRL) using 20mg DNA and 50ml Lipof ectamine per 1 5cm 
dish. Cells were maintained in DMEM supplemented with 10% fetal bovine 

30 serum. 

The fiber deletion mutation of Ad5.iffgal.AF is complemented in trans by 
passaging virions through 21 1B, a cell line that stably expresses functional Ad5 
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fiber. The present system was designed to complement Ad5.jffgal.AF by 
modified fibers expressed from transfected episomal plasmids in 293T cells. The 
result is a simplified and rapid method to incorporate modified fibers on a viral 
particle containing the Ad5./?gal.AF genome that does not require propagation of 
5 the virus. 

The feasibility of transcomplementation of Ad5.jffgaI.AF with episomal 
fiber-expressing plasmids was demonstrated in the following experiment. 293T 
cells were transfected with one of two plasmids: pDV55, which expresses no 
fiber or pDV60, which expresses wildtype Ad5 fiber. Fiber expression persists 
10 for at least six days. Twenty-four hours after transfection, these cells were 

infected at 2000 particles/cell with Ad5.jffgal.AF passaged through 21 1B cells. 1 1 

Seventyrtwri hours later, a crude viral lysate (CVL) was generated by exposing z^^-l fcw 
the cells-tb-five f reeze-thaw cycles. Viral particles were purified by cesium 
chloride gradient centrifugation. The resulting virions incorporated the fiber 
15 expressed from the episomal plasmid, as confirmed by Western blots performed 
with an antibody specific to the Ad5 fiber. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in th& context of a 
viral particle. Episomal plasmid transcomplementation will also be of great utility 
20 for quickly evaluating a bank of modified fibers for other binding properties, 

including new tropisms and the ablation of the native tropism. In addition to the 
rapid generation and testing of large numbers of modified fibers, there are other 
advantages to the Ad5.jffgal.AF transcomplementation system in terms of 
production and safety. Episomal plasmid transcomplementation has the inherent 
25 advantage over transcomplementation in that it is not necessary to make a 
stable cell line for every modified fiber for complementation with Ad5.£gal.AF. 
Because the Ad5.jffgal.AF is deleted in E1 , E3 and fiber, there is an additional 
gene deletion, which should render it very suitable for gene therapy. In addition, 
the presence of the fiber gene deletion decreases the opportunity to generate 
30 replication-competent virus via recombination in the packaging cells. A single 
Ad vector preparation can be retargeted to any number of different cell types 
simply by transfecting the cells with the appropriate fiber-expression construct. 
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EXAMPLE 8 

Preparation of Adenoviral Gene Delivery Vectors Containing the Ad37 fiber 
protein 

This example describes construction of packaging cell lines expressing 
5 the Ad37 fiber protein, and their use in generating particles of a fiber-deleted Ad 
vector {such as Ad5.jffgal.AF) containing this fiber protein. The fiber protein is 
attached to the viral capsid by binding to the penton base protein through its N- 
terminus, and the Ad37 fiber was mpdified in order to make its N-terminal 
sequence more closely match that of the Ad5 protein to ensure that it would 
10 efficiently bind the Ad5 penton base in these vectors. 

A. Materials and methods 

Cell lines and wild-type adenovirus. . Human A549 lung carcinoma 
epithelial cells and human Chang C cqnjynctiya!,: cells (American Type Culture 
Collection) were maintained in complete Dulbecco's Modified Eagle Medium 

15 (DMEM) with 10% fetal bovine serum. Wild-type Ad19p and Ad37 (ATCC) were 
propagated in A549 cells and purified by banding on CsCI 2 density gradients as 
previously described (Huang eta/. (1 999) J. ViroL 75:2798-2802). Viral protein 
concentration was determined by the Bio-Rad Protein Assay, and was used to 
calculate the number of viral particles based on the known molecular weight of 

20 Ad2 virions (1 //g = 4 x 10 9 particles). 

B. Construction of the Ad37 fiber expressing cell lines and the recombinant 
Ad37 knob protein. 

1 . Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDV80) 

25 The plasmid designated pDV80 (see, SEQ ID No. 42) prepared for 

expression of the Ad37 fiber protein in mammalian cells, uses the same 
regulatory elements as the elements in pDV60, pDV67, and pDV69 to express 
the Ad37 fiber in packaging lines. It was constructed in two steps. 

First, the Ad37 fiber open reading frame was amplified from Ad37 

30 genomic DNA using synthetic oligonucleotide primers, L37: 5' TGT CCT GGA 
TCC AAG ATG AAG CGC GCC CGC CCCAGC GAA GAT GAC TTC 3'{SEQ ID 
NO. 43) and 37FR: 5' AAA CAC GGC GGC CGC TCT TTC ATT CTT G 3' SEQ 
ID NO. 44). L37 contains nucleotides (underlined) that differ from the Ad37 
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genomic sequence in order to add a unique BamHI site (bold) before the start 
codon (italicized) and to create point mutations that make the N-terminal 
sequence of the fiber more closely match the N-terminal sequence of the Ad5 
fiber protein as follows: 
5 Ad37 MSKRLRVEDDFNPVYPY (SEQ ID No. 45) 

KRARPS (SEQ ID No. 46) 

Ad5 MKRARPSEDTFNPVYPY (SEQ ID No. 47) . 

37FR also incorporates a unique /Vof1 site (bold). The PCR product was inserted 
10 into the BamHI and A/of1 sites of pCDNA3.1zeo( + ) (Invitrogen) to create 
PDV78* The correct sequence of the Ad37 fiber protein, including inserted 
changes, was confirmed by sequencing. 
, u JWP Point mutations in the fiber gene in the 705 line, S356 to P356 and 

;ri ^..t: ^ 13^? ^° T362, were discovered by the sequencing. The mutations are^npt in'jfie 
15i .receptor binding domain in Ad37 fiber gene in the 705 ceil line. They' are buried 
in the knob trimer interface. To confirm that the these mutations do not affect 
receptor binding, jhe Ad37 fiber protein with the correct sequence was recloned, 
and 293T cells transfected with the virus and subsequently infected with 
Ad5.GFPAF to produce Ad37 pseudotyped virus. The results were the same as 
20 the results of the experiments with Ad37 pseudotyped virus produced from line 
705 (see, Wu et a/. (2001 ) Virology 273:78-89). 

Second, a 1 .2 kb Bam HIIBgf II fragment containing an adenovirus type 5 
tripartite leader was excised from pDV55 (see EXAMPLE 3) and inserted into the 
Bam H1 site of pDV78 to create pDV80 (SEQ ID No 42). Plasmid pDV80 has 
25 been deposited in the ATCC under accession number PTA-1 147. ( 
2. Construction of the recombinant Ad37 knob protein 
Recombinant Ad37 knob protein containing an N-terminal T7»Tag was 
produced in E. coli using the PET expression system (Novagen). Ad37 fiber DNA 
(GenBank accession number U69132) was PCR amplified from wild-type Ad37 » 
30 genomic DNA using the following primers (SEQ ID Nos. 48 and 49): 
5' GGATCC4 7GGGATACTTGGTAGCA 3' {BamHl site underlined and 
5' GCAACTCG^GTCATTCTTGGGCAATATAGG 3'(Xhol site underlined). 
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The PCR reactions were performed at 94°C (denaturation), 55 °C 
(annealing), 72 °C (extension, 30 cycles) using Tag DNA polymerase (Qiagen). 
The amplified DNA fragments, which contained residues 172 to 365 of the Ad37 
fiber protein with the addition of an N-termina! start codon (italicized), were 
5 purified and subcloned into the pCR-TOPO vector using the TA-Cloning Kit 
(Invitrogen). No replication errors were found by DNA sequencing. Plasrnids 
from cultured transformed colonies were purified and digested with BamHl and 
Xho\. The fragment was inserted into the BamHl and Xho\ sites of the bacterial 
expression vector, pET21a (Novagen), and transformed into (DE3)pLYS S 
10 expression cells (Invitrogen). Colonies were selected for knob expression by 
induction with 1 mM IPTG for four hours at 37 °C and knob expression was 
determined by SDS-PAGE. The. colony displaying highest knob expression was 
used for large-scale knob %xpressibn and induced with 0.5 mM IPTG at 30°C for 
four hours. 

15 The recombinant T7 'Tagged Ad37 knob protein was purified from 

sonicated bacteria using the T7«Tag Affinity Purification Kit as recommended by 
the manufacturer (Novagen). Recovered protein was analyzed for purity by SDS- 
PAGE followed by Coomassie staining or Western blotting with an HRP- 
conjugated cr-T7»Tag monoclonal antibody as described by the manufacturer 

20 (Novagen) or an a-Ad37 fiber rabbit antibody. 

3. Preparation of Cell Lines that Express the Ad37 fiber protein 
Plasmid pDV80 DNA was purified using the Qiagen method and 
electroporated into the adenovirus-complementing cell line E1-2a S8 (see 
Examples herein; see also, Gorziglia etal. (1996) J. Virology 70:4173-4178; 

25 and Von Seggern etal. (1998) J. Gen. Virol. 75:1461-1468). Stable clones were 
selected with 600 //g/ml zeocin (Invitrogen). 

Clones were expanded and were screened for fiber expression by indirect 
immunofluorescence (Von Seggern etal. (1998) J. Gen. Virol. 73:1461-1468) 
using a rabbit polyclonal antibody directed against the Ad37 fiber (or-Ad37 fiber 

30 rabbit antibody) raised by immunizing rabbits with recombinant Ad37 fiber 
protein. Two clones (lines 705 and 731) that expressed the protein at a 
uniformly high level were selected. 
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EXAMPLE 9 

Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 
fiber protein, the Ad37 fiber-expressing 705 cells were infected {approximately 
5 1000 particles/ceil) with Ad5.jffgal.AF or with Ad5.GFP.AF. 
Materials and methods 

Ad5./?gal.AF 

The construction of Ad5.0gaI.AF is described in Example 2 (it has been 
deposited on January 15, 1999, with the ATCC as listed above under accession 
10' number VR2636; see also, Von Seggern etaL (1999) J.V/roL 73:1601-1608; 

, -i copending U.S. application Serial No, 09/482,682 filed January 14, 2000, and 
o " also International PCT application No. PCT/US00/00265; filed UarYuary- 14, 
2000). ' ;^Xr-^P:r 

Ad5.GFP.AF ? 
15 Ad5.GFP.AF was constructed by recombination in bacteria using a 

modification of the AdEasy System (see, U.S. Patent No. 5,922,576; see, also 
He eta/. (1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the system is 
publicly available from the authors and other sources). 

First, a fiber-deleted genomic plasmid was constructed by removing the 
20 fiber gene from pAdEasy-1 (see, U.S. Patent No. 5,922,576; and He etaL 
(1998) Proc. Natl. Acad. Sci. U.S.A. 35:2509-2514; the AdEasy system and 
vectors are publicly available from He et al. at Johns Hopkins University). 
Plasmid pAdEasy-1 contains the entire Ad5 genome, except for nucleotides 1- 
3,533, which encompass the E1 genes, and nucleotides 28„ 130-30,820, which 
25 encompass the E3 gene. 

Plasmid pDV43 (see Example 2; see, also Von Seggern etaL (1999) 
J.ViroL 73:1601-1608) was digested with Part, the ends blunted by treatment 
with the large fragment of E. coli DNA polymerase and dNTPs, and the product 
re-ligated to produce plasmd pDV76. The resulting plasmid pDV76 is identical to 
30 pDV43 except for loss of the Part site and contains the right end of the Ad5 
genome with E3 and fiber deletions. A 4.23 kb fragment from PDV76 was 
amplified using the oligonucleotide primers (SEQ ID Nos. 50 and 51: 
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S' CGC GCT GAC TCT TAA GGA CTA GTT TC 3', including the unique Spe1 
site in the Ad5 genome (bold); and 5' GCG CTT AAT TAA CAT CAT CAA TAA 
TAT ACC TTA TTT T 3', including a new Pac\ site (bold) adjacent to the right 
Ad5 ITR. Hence the resulting PCR amplified fragment contains nucleotides 
5 27,082 to 35,935 of the Ad5 genome with deletions of nucleotides 28,133 to 
32,743 (the E3 and fiber genes), and was used to replace the corresponding 
Spe\}Pac\ fragment of pAdEasy 1 (see, U.S. Patent No. 5,922,576) to create 
pDV77. 

Second, E. coli strain BJ5183 was electroporated with a mixture of 
10 pDV77 and Pme*[ -linearized pAdTrack as described (U.S. Patent No. 5,922,576; 
He etal. (1998) Proc. Natl. Acad. ScL U.S.A. 35:2509-2514), and DNA was 
isolated from kanamyciri-resistant colonies. The resulting plasmid, pDV83, 
contains a complete Ad5 genome with E1-, E3-, and fiber-deletions with a CMV- 
driven GFP reporter gene inserted at the site of the E1 deletion. The full length 
15 Ad chromosome was isolated by Pacl digestion, and transfected into the El- 
and fiber-complementing 633 cells (Von Seggern et al. (2000) J. ViroL 74:354- 
362). The 633 cells were produced by electroporating pDV67 (SEQ ID No. 30, 
deposited under ATCC accession number PTA-1 145) into the E1-2a S8 cells, 
described above. The recovered virus Ad5.GFP.AF was then plaque purified by 
20 plating on 633 cells and virus stocks were prepared by freeze-thawing cell 
pellets. 

Ad5-pseudotype particle production 

Particles with Ad5 fiber 
Ad5-pseudotyped particles were generated by virus growth in 633 cells, 
25 which express the wild type Ad5 fiber protein. Viral particles were isolated and 
purified over CsCI gradients (Von Seggern etal (1999) J. ViroL 73:1601-1608; 
purified by centrifugation on preformed 1 5-40% CsCI gradients (111 ,000 x g for 
three hours at 4°Q). For analysis of viral proteins, ten jjg of the purified 
particles were electrophoresed on 8-16% gradient gels and the protein 
30 transferred to nylon membranes. The resulting blot was probed with rabbit 
polyclonal antibodi s raised against recombinant Ad37 fiber or Ad5 fiber or 
penton base proteins expressed in baculovirus-infected cells. 
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Particles with Ad37 fiber 

Cells from the Ad37 fiber producing cell line 705 were infected at 
approximately 1000 particles/cell with Ad5.jffgal.AF or with Ad5.GFP.AF. Viral 
particles were isolated and purified over CsCI gradients. The bands were 
5 harvested, dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% NaCI, and 
10% glycerol), aliquoted and stored at-70°C. 

Viral protein analyses 

For analysis of viral proteins, 10 pg of purified Ad5.yffgal.AF particles with 
no fiber (grown in 293 cells), the Ad5 fiber (grown in 633 cells), or the Ad37 
10 fiber (grown in 705 cells) were electrophoresed by 8-16% polyacrylamide 

gradient SDS-PAGE and the proteins were transferred to nylon; membranes. The 
blot was then probed with a-Ad37 fiber rabbit antibody- Ad5 fiber and pentbn 
base were detected by reprobing the blot with 'polyclonal' antibodies raised 
against recombinant proteins expressed in baculovirus-infected cells (Wickman et 
15 al. (1993) Cell 2:309-319). 

Adenovirus infection and cell binding assays 

Adherent Chang C and A549 cells were infected with GFP expressing 
Ad5 vectors containing the Ad5 fiber (Ad5.GFP.AF/5F) or the Ad37 fiber 
(Ad5.GFP.AF/37F) at 10,000 particles per ceil for 3 hours at 37 °C, 5% C0 2 in 

20 DMEM, 10% FCS. Cells were washed twice with saline and then cultured 

overnight at 37° C, 5% C0 2 . The next day, the cells were detached with buffer 
containing 0.05% (w/v) trypsin and 0.5 mM EDTA (Boehringer Mannheim) for 5 
minutes at 37 °C. Suspended cells were washed once with PBS and then 
resuspended in phosphate-buffered saline (PBS), pH 7.4. GFP fluorescence was 

25 measured with a FACScan flow cytometer. A threshold established by the 
fluorescence of uninfected cells was used to distinguish cells expressing GFP. 
To assess the role of CAR in Ad infection, 10,000 attached ceils were pre- 
incubated with 1 80 jug/ml RmcB, a function-blocking anti-CAR monoclonal 
antibody (Hsu et al. (1988) J.ViroL 52:1647-1652), in complete DMEM for 1 

30 hour at 4°C. A small volume containing Ad5.GFP.AF/5F or Ad5.GFP.AF/37F 
was then added at 10,000 particles per cell. The cells were infected for 3 
hours, cultured overnight, harvested, and analyzed for GFP expression. Percent 
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cells expressing GFP was determined by the percent of cells detected above a 
threshold set by the fluorescence of uninfected Chang C cells. 

To measure adenovirus binding to cells, wild type Ad37 was labeled with 
125 l using lodogen (Pierce) according to manufacturer instructions and separated 
5 from free t25 l by gel filtration as described (Huang et al. (1999) J. Virol. 

73:2798-2802). Binding of radiolabeled wild type Ad37 on Chang C cells was 
then quantitated as described (Huang et al. (1999) J. Virol. 73:2798-2802). 
Non-specific binding was determined by incubating ceils and labeled Ad37 
particles in the presence of 100-fold concentration of unlabeled Ad37. Specific 

10 binding was calculated by subtracting the non-specific binding from the total 
cpm bound. To examine if divalent cations are required for binding, 10 mM 
ethylenediaminetetraacetic acid (EDTA) or various concentrations of CaCI 2f or' ' 
MgCi$ wereradded to cells before incubation with labeled virus. To examine if-' 
the receptor for Ad37 is a protein, cells were pretreated with 10 //g/ml trypsin 

15 (GIBCO), subtilisin (Sigma), proteinase K (Boehringer-Mannheim), and bromelain 
(Sigman) at 37 °C for 1 hour, then washed twice with complete DMEM before 
adding labeled virus. Cells were >95% viable after protease treatment. 

Ad37 binding to conjunctival cells is calcium-dependent. Specific 125 l- 
labeled Ad37 binding to Chang C cells was measured in the presence of 10 mM 

20 EDTA and in the presence of varying concentrations of calcium chloride or 
magnesium chloride. Specific binding was determined by subtracting the 
nonspecific counts in the presence of 100-fold excess unlabeled virus from the 
total counts. 

Pretreatment of conjunctival cells with proteases inhibits Ad37 binding. 

25 Change C cells were pretreated with various proteases for 1 hour before binding 
125 l-labeled Ad37 to the cells. Nonspecific binding was measured by adding 
100-fold unlabeled Ad37 to cells with 125 l-labeled Ad37 and subtracting from 
total counts for specific binding. Percent inhibition represents the difference in 
specific binding of untreated cells and pretreated cells as a percentage of the 

30 specific binding of untreated cells. 
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Virus overlay protein blot assay (VOPBA) 

For VOPBA of human conjunctival membrane proteins probed with Ad37 
in the presence of EDTA or calcium chloride, Chang C membrane fractions were 
separated by 8% SDS-PAGE and transferred to a PDVF membrane. The 
5 membrane was subsequently probed with or without whole Ad37 particles, a 
polyclonal antibody against Ad37 fiber, and finally a horseradish peroxidase 
conjugated anti-rabbit antibody, in the presence of EDTA or calcium chloride. 
Transferred Chang C membrane proteins were probed with recombinant Ad37 
knob protein, instead of Ad37 knob, in the presence of calcium chloride. 

10 Confluent monolayers of Chang C and A549 cells were detached by 

scraping, pelleted by centrifugation, and then restispended in 250 mM sucrose, 
20 mM HEPES, pH 7.0, 1 mM EDTA; arid 2 /yg/ml aprotinin and leupeptin. Cells 
were transferred into a dounce homogehizer and disrupted with 30 strokes. 
Organelles and nuclei were pelleted at 500g for 15 min. Plasma membrane 

15 fragments were then pelleted from the supernatant of cell lysates at 200,000g 
for 1 hour and then resuspended in 10 mM Tris«CI, pH 8.1, 10 //g/ml aprotinin 
and leupeptin. 

Cell membranes of Chang C or A549 cells were incubated (1:1) with a 
2% SDS, non-reducing buffer and separated on an 8% polyacrylamide gel 

20 without boiling. Membrane proteins were then electroblotted onto a PVDF 
membrane (Immobilon-P) and blocked in 5% (w/v) milk in PBS, pH 7.4, 0.02% 
Tween-20 (PBS-T). After blocking, the membrane was incubated with 1 //g/ml 
wild-type Ad19p or Ad37 in 0.5% (w/v) milk in PBS-T, 1 mM CaCI 2 , for 1 hour 
at room temperature. The membrane was then washed once with phosphate- 

25 buffered saline, pH 7.4 (PBS), 1 mM CaCI 2 , and incubated with 1:500 dilution of 
ff-Ad37 fiber rabbit antibody in 0.5% (w/v) milk in PBS-T, 1mM CaCI 2 , for 30 
minutes at room temperature. The membrane was washed again with PBS, 1 
mM CaCI 2 , and incubated with 1 :5000 dilution of horseradish peroxidase (HRP) 
conjugated ar-rabbit antibody (Sigma) in 0.5% (w/v) milk in PBS-T, 1 mM CaCl 2 , 

30 for 30 minutes at room temperature. The membrane was washed four times in 
PBS, 1 mM CaCI 2 , once with PBS-T, 1 mM CaCI 2 , and once in 1 mM CaCI 2 . The 
blot was developed with enhanced chemiluminescence reagents (Pierce) for 5 
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minutes and placed onto a piece of Biomax film (Kodak) for 5 seconds to 1 

minute. For divalent metal cation experiments, membranes were incubated in 

the presence of 2 mM EDTA instead of 1 mM CaCI 2 in all solutions. To assay 

fiber knob binding to cell membrane proteins, membrane filters were incubated 

5 with 1 /yg/ml purified T7-tagged Ad37 knob protein in Tris-buffered saline, 0.1 % 

Tween-20, 1 mM CaC1 2 , for 1 hour at room temperature. a-Ad37 fiber rabbit 

antibody and HRP-conjugated anti-rabbit antibody were applied and the 

membrane was developed with substrate solution as described above. 

Results: Comparison of adenovirus infection of human conjunctival and lung 
10 epithelial cells with virus particles retargeted with Ad5 or Ad37 fiber proteins 

Packaging cell lines producing the Ad37 fiber protein were generated. 

Since the N-terminal amino acid sequences of the Ad5 and Ad37 fiber proteins " 

differ significantly, and to ensure that the Ad37 fiber would be efficiently - ; 

incorporated into Ad5 vector particles, several residues in the wild-type Ad37 

15 fiber were mutated to more closely match the Ad5 sequence. Stable cell lines 
producing this fiber under control of the CMV promoter and the adenovirus type 
,5 tripartite leader were then generated and screened for fiber expression by 
indirect immunofluorescence. One clone (line 705), which expressed the Ad37 
fiber at a high level, was selected for further study. 

20 Cells from one cell line 633, which expresses the wild-type Ad5 fiber 

protein, and line 705 were infected with a fiber-deleted Ad5 vector carrying a 
jffgalactosidase reporter gene. The resulting vector particles contained the Ad5 
fiber protein (Ad5.jffgai.AF/5F) and the Ad37 fiber protein (Ad5.jffgal.AF/37F), 
respectively. Incorporation of the correct fiber protein into viral particles was 

25 verified by Western blotting. Adenoviral vectors containing the GFP reporter 
gene, Ad5.GFP.AF/5F and Ad5.GFP.AF/37F, were created in the same fashion. 

Infection of a variety of cell types using the retargeted adenovirus 
particles was examined. As assayed by GFP fluorescence, Ad5.GFP.AF/5F 
exhibited good gene delivery to lung epithelial (A549) and conjunctival celis 

30 (Chang C). In contrast, Ad5.GFP.AF/37F efficiently delivered GFP to Chang C 
cells, but exhibited very poor gene delivery to A549 cells. Although CAR is 
expressed on the surface of A549 cells, as indicated by AD5.GFP.AF/5F 
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infection, Ad5.GFP.AF/37F was unable to infect these cells efficiently. This 
experiment shows that the Ad37 fiber protein can confer preferential infection of 
human conjunctival cells, but not CAR-expressing human lung epithelial cells. 

Hence CAR is not the primary receptor for Ad37. Recent studies reported 
5 that expression of CAR on the surface of Chinese hamster ovary (CHO) cells did 
not improve Ad37 binding (Arnberg et al. (2000) J. Virol. 74:42-48), implying 
that Ad37 does not use CAR as a primary receptor. In order to verify this on 
human conjunctival cells, A549 and Chang C cells were pretreated with RmcB 
(Hsu et al. (1 988) J. Virol. 62:1647-1 652), a function-blocking monoclonal 

10 antibody against CAR. The RmcB antibody inhibited infection of A549 cells by 
Ad5.GFP.AF/5F, but it had little effect on infection of Chang C cells by 
Ad5.GFP.AF/37F. This indrcates'that CAR is not the primary receptor for Ad37 
on Chang C conjuncthferf ibells. > 

Ad37 binding to conjunctival cells requires divalent metal cations. It has 

15 been proposed (Roelvink et al. (1998) J.Virol. 72:7909-7915) that a combination 
of fiber binding to CAR and penton base binding to cr v -integrins allows some 
adenovirus serotypes to attach to cells. Although cr v -integrin binding to the RGD 
motif of the adenovirus penton base is of relatively low affinity (Wickmah et al. 
(1993) Cell 2:309-31 9), it may nonetheless contribute to viral attachment to the 

20 cell surface. Ad37 shows a particularly strong affinity for binding to integrin ogff 5 
(Mathias et aL (1998) J. Virol. 72:8669-8675), suggesting that integrin 
might be a primary receptor for Ad37. Binding of the RGD motif by cr v -integrins 
requires the presence of divalent cations, such as calcium or magnesium (Stuiver 
et al. (1996) J. Cell Physiol. 765:521-531). In contrast, no divalent cations were 

25 required for binding in the CAR-Ad12 knob complex (Bewley eta/. (1999) 
Science 256:1579-1583). 

To investigate the potential role of or v -integrins and divalent metal cations 
in Ad37 receptor binding, 125 l-labeled Ad37 binding to Chang C cells was 
examined in the absence or presence of EDTA. EDTA inhibited Ad37 binding to 

30 conjunctival cells but did not alter Ad5 binding. Thes findings suggest a 
requirement for divalent metals for Ad37 binding. 
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The presence of either calcium or magnesium ions helps oJ3 5 organize in focaf 
contacts {Stuiver etaL (1996) J.Cell PhysioL 753:521-531), suggesting that 
calcium and magnesium aid in integrin crji 5 function. To further test the 
potential role of integrin in Ad37 cell attachment, 12S Mabeled Ad37 binding 
5 to Chang C cells was measured in the presence of varying concentrations of 
calcium or magnesium chloride. Magnesium ions had little effect on Ad37 
binding to Chang C cells. In contrast, calcium ions dramatically enhanced Ad37 
binding to Chang C cells. The optimal concentration of calcium chloride for 
Ad37 binding was 1 mM, while higher concentrations of calcium actually 

10 decreased virus binding to cells. The fact that calcium, but not magnesium, 

v s promoted Ad37 attachment is not consistent with integrin qg9 5 as the* primary 
receptor for viral attachment to the cells since either metal wilf support ligand 
binding to integrin ajff 5 . Moreover, A549 cells express ^btiridant cr v -integriris 
(Mathias etaL (1998) J.Virol. 72:8669-8675) but were unable to support 

15 efficient binding of Ad37. 

Wild-type Ad37 particles bind to three conjunctival membrane proteins. 
Recent studies reported that protease treatment of CHO cells abolished Ad37 
binding {Arnberg et al. (2000) J. ViroL 74:42-48), implying that Ad37 bound to 
a protein receptor on CHO cells, Scatchard analysis of Ad37 binding to Chang C 

20 cells showed that each cell expresses approximately 24,000 fiber binding sites 
(Huang et al. (1 999) J. Virol. 73:2798-2802). To determine if the Ad37 binding 
site on human conjunctival cells is also a protein, Chang C cells were treated 
with different proteases prior to measuring binding of 125 I-Iabeled Ad37. 
Digestion of surface proteins by all four proteases inhibited Ad37 binding to 

25 Chang C cells by greater than 50%. This finding showed that Ad37 also binds 
to a protein receptor on Chang C cells. 

Virus overlay protein blot assays (VOPBAs) were used to identify 
candidate viral protein receptors. This Western blot technique uses intact viral 
particles in place of antibodies to probe viral-receptors interactions. VOPBA was 

30 used herein to identify Chang C membrane proteins that bind to Ad37. In the 
absence of Ad37 particles, no protein bands were observed, while addition of 
virus in the absence of calcium revealed binding to a single 45 kDa protein. In 
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the presence of 1 mM calcium chloride, Ad37 reacted with three proteins with 
approximate molecular weights of 45, 50 and 60 kDa. The same three proteins 
were detected using a recombinant Ad37 fiber knob alone, indicating that Ad37- 
receptor interactions are fiber mediated and do not involve interactions of other 
5 capsid proteins such as the penton base. The size of the calcium-independent 
protein (45 kDa) is very similar to the known molecular weight of CAR. A direct 
comparison of the Ad37 VOPBA and a CAR Western biot showed that the 45 
kDa receptor co-migrates with CAR dn SDS-PAGE. Moreover, two other 
members of subgroup D adenoviruses, Ad9 and AD15, have been shown to bind 

10 to CAR (Roelvink etaL (1998) J. Virol. 72:7909-7915). 

Since CAR does not appear to mediate Ad37 binding on intact Chang C 
cells, the possibility- that the 50 or 60 kDa protein serves this function was 
tested by examining an adenovirus serotype that does not bind to Chang C cells. 
Ad19p, a closely related subgroup D adenovirus, binds poorly to Chang C cells 

15 (Huang et al. (1999) J. Virol. 75:2798-2802) and Ad1 9p recognition of the 
Ad37 receptor is therefore unlikely. Ad19p particles bound to the 45 and 60 
kDa receptors in the VOPBA, but did not bind to the 50 kDa receptor. 
Moreover, the 50 kDa receptor is expressed on Chang C cells, but not A549 
cells, which only support low levels of Ad37 binding^ and infection. Taken 
20 together, these data indicate that the 50 kDa protein is a primary candidate 
receptor for Ad37 on human conjunctival cells. 
Discussion 

The identification of the CAR protein as a major adenovirus receptor does 
not explain why certain subgroup D members, such as Ad37, preferentially 

25 infect ocular cells. A 50 kDa human conjunctival cell membrane protein is 

identified herein as a primary candidate for the receptor for Ad37. This 50 kDa 
protein is not present on A549 lung epithelial cells. Ad37 binding to this receptor 
is calcium-dependent, which is consistent with Ad37 binding and infection 
experiments. Ad37 also bound to a 60 kDa protein that is present on human 

30 conjunctival and lung epithelial cells. It does not, however, appear to be 

serotype specific. The molecular weights of MHC class I heavy chain, which has 
been propos d as a receptor for Ad5, and aji z and aj? 5 intergrins, receptors for 
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the penton base, are distinct from the 50 or to kDa receptor characterized in this 
study. 

The studies of Ad37-receptor interaction using VOPBAs are consistent 
with previous studies showing that subgroup D adenoviruses can bind to the 
5 extracellular domain of CAR (Roelvink et aL (1998) J.Virol. 72:7909-7915). 
Biochemical and structural studies on knob-CAR interactions indicate that the 
CAR binding site is located on the AB-loop of the fiber knob. Alignment of the 
fiber sequences of Ad37 and other adenoviruses reveals that the AB-loop of 
Ad37 is similar to those of Ad 1 2 and Ad5. Moreover, a phyiogenetic tree of 
10 adenovirus knobs (Roelvink et aL (1998) J. Virol. 72:7909-7915) shows that 
v fiber proteins of subgroup D are similar to those of subgroup C and E, which use 
=- CAR as their primary receptor. Ad37 does not; however, appear to effectively 
use CAR as a primary receptor, as demonstrated by virus binding and infection 
studies on Chang C conjunctival cells and A549 lung epithelial cells; 

15 It has been reported that Ad37 uses sialic acid as a receptor on Chinese 

hamster ovary (CHO) cells and human lung carcinoma (A549) cells (Arnberg et 
aL ((2000) J. ViroL 74:42-48). Human conjunctival cells were not studied. 
Human corneal epithelial (HCE) cells were the only ocular cell line studied and 
Ad37 binds relatively poorly to these cells, compared to binding on A549 cells 

20 (Arnberg et aL ((2000) J. ViroL 74:42-48). In addition, 8,4 X 10 7 wheat germ 
agglutinin molecules per cell were required to significantly inhibit Ad37 binding 
to sialic acid on sialic acid positive CHO cells (Arnberg et aL (2000) J. ViroL 
74:42-48), three orders of magnitude higher than the number of Ad37 receptors 
on Chang C conjunctival cells (Huang et aL (1999) J. ViroL 73:2798-2802). 

25 Clearly, sialic acid is not the only factor responsible for Ad37 binding to the cell 
surface and its influence on Ad37 tropism is unclear. 

The results herein show that Ad37 selects a 50 kDa cellular receptor for 
binding to conjunctival cells, but it is possible that sialic acid also plays a role in 
this interaction. The characterization and identification of the Ad37 receptor 

30 have therapeutic implications and also explain the different tropism of Ad37. 
The 50 kDA receptor for Ad37 may also be the receptor for other subgroup D 
adenoviruses that cause severe cases of EKC, Ad 19a and Ad8. Ad19p is a 
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nonpathogenic variant of Ad19 {Amberg etaL (1998) Virology 227:239-244} 
while Ad 19 A, along with Ad8 and Ad37, are major causes of EKC. Ad19a and 
Ad37 have identical fiber proteins (Arnberg etaL (1998) Virology 227:239-244) 
and have similar tropism in vivo. Ad8, Ad19a, and Ad37 agglutinate dog and 
5 guinea pig erythrocytes more effectively than four other serotypes that are 
associated with less severe forms of conjunctivitis (Arnberg etaL (1998) 
Virology 227:239-244), implying that the receptors of Ad18, Ad1 9A, and Ad37 
have similar characteristics. Hence,-this 50 kDa receptor is an attractive drug 
target against EKC caused by adenoviruses to provide therapeutic intervention of 

10 ocular diseases associated with these viruses. 

? >o«-. EXAMPLE 10 

Targeting of: the Ad5 vector to photoreceptor cells ; > # v 

i&'vftri '-.The fiber-deleted adenovirus vector Ad5.GFP.AF was propagated in 705. 
ceils, which express a modified Ad37 fiber protein. Viral particles 

15 (Ad5.GFP.Af/37F) were harvested, CsCI-purified and dialized into 0.9% NaCI, 10 
mM Tris, pH 8.1, and 10% glycerol. Two to three /j\ of the resulting solution, 
containing approximately 1 x 10 9 particles///! was injected into the vitreous 
chamber of a mouse eye. Seven days post-injection, eyes were harvested, fixed 
with paraformaldehyde and cryo-sectioned. Sections were stained with an anti- 

20 rhodopsin antibody to identify photoreceptor cells and with DAPI to show all cell 
nuclei. The resulting sections showed red anti-rhodopsin staining in the 
photoreceptors, blue DAPI-stained nuclei, and green GFP staining in any 
transduced cells. The results revealed substantially exclusive transduction of 
photoreceptors. Co-localization of rhodopsin staining and GFP expression 

25 indicated selective transduction of photoreceptor cells. 

As a control, contralateral eyes were injected with a stock of the fiber- 
deleted vector AD5./?gaI.AF grown in the same Ad37 fiber-expressing cells. 
Since this virus (Ad5.£gal.AF/37F) produces £gal rather than GFP, the green 
staining is absent from the photoreceptors. 

30 Additional experiments using the AD37 fiber for targeting to the 

photoreceptor cells have been performed. Subretinal and intra vitreal injection 
have been used in mouse models and the results demonstrate targeting to the 
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photoreceptors. As with intravrtreally injected eyes, the major cell type infectd 
via subretinal administration was the photoreceptor. 

As noted, Ad5.GFP.AF /37F infected Chang C cells efficiently, but A549 
cells poorly. Ad37 fiber protein confers preferential infection on human 
5 conjunctival cells, but not CAR-expressing human lung epithelial cells. Binding 
to conjunctival ceils requires divalent cations. 



Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims: 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid molecule, comprising: 

adenovirus inverted terminal repeat sequences; an adenovirus packaging 

signal operatively linked thereto; and a photoreceptor-specific promoter. 
5 2. The isolated nucleic acid molecule of claim 1, further comprising a 

nucleic acid encoding a therapeutic product operatively linked to the promoter. 

3. The isolated nucleic acid molecule of claim 1, wherein the 
promoter is a rhodopsin promoter, 

4. The nucleic acid molecule of claim 1 , wherein the adenovirus 
10 genome does not encode a functional fiber protein such that packaging the 

nucleic acid requires complementation in a packaging cell. -ru* 
* 5. A recombinant adenovirus vector, comprising the nucleic acid 

molecule of any of claims 1-4 packaged therein. ^ ^ -i ^ 

6. A recombinant adenovirus vector of claim 5, wherein inverted 
1 5 terminal repeat sequences (ITR) and a packaging signal are derived from 

adenovirus type 2 or adenovirus type 5. 

7. A recombinant adenovirus vector of claim 5, wherein the virus 
comprises a fiber protein. 

8. A recombinant adenovirus vector of claim 7, wherein the fiber 
20 protein selectively binds to photoreceptors in the eye of a mammal. 

9. A recombinant adenovirus vector of claim 7, wherein the fiber \s a 
chimera composed of N-terminal sequences from adenovirus type 2 or type 5, 
and a sufficient portion of an adenovirus serotype D fiber for selective binding to 
photoreceptors in the eye of a mammal.. 

25 1 0. A method for targeted delivery of a gene product to the eye of a 

mammal, comprising: 

administering a recombinant adenovirus virus that comprises heterologous 
DNA encoding the gene product or resulting in expression of the gene product, 
wherein the recombinant virus comprises a fiber protein that specifically or 

30 selectively binds to receptors that are expressed on cells in the eye. 
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1 1 . The method of claim 10, wherein the cells are photoreceptors. 

12. The method of claim 10, wherein administration is effected by 
intraocular delivery. 

13. The method of claim 10, wherein administration is effected by a 

5 method selected from subretinal injection, intravenous administration, periorbital 
administration, and intravitreal administration. 

14. The method of claim 10, wherein the recombinant virus comprises 
a fiber protein from an adenovirus type D serotype. 

15. The method of any of claims 10-14, wherein the fiber protein is an 
10 adenovirus type 37. 

16. The method of any of claims 10-14, wherein the fiber is a chimeric 
protein containing a, sufficient portion of the N-terminus of an adenovirus type 2 
or type 5 fiber protein for interaction with an adenovirus type 2 or type 5 
penton, and a sufficient portion of an adenovirus serotype D knob portion of the 

15 fiber for selective binding to photoreceptors in the eye of a mammal. 

17. The method of any of claims 10-16, wherein the recombinant virus 
is an adenovirus type D serotype. 

18. The method of any of claims 10-17, wherein the encapsulated 
nucleic acid comprises a photoreceptor-specific promoter operatively linked to a 

20 nucleic acid comprising the therapeutic product. 

1 9. The method of claim 1 8, wherein the therapeutic product is 
selected from the group consisting of a trophic factor, an anti-apoptotic factor, a 
gene encoding a rhodopsin protein, a wild-type Stargardt disease gene (STDG1), 
an anti-cancer agent and a protein that regulates expression of a photoreceptor- 

25 specific gene product. 

20. The method of any of claims 10-19, wherein delivery is effected 
for treatment of an ocular disease. 

21. The method of claim 20, wherein the disorder is a retinal 
degenerative disease. 

30 22. The method of claim 20, wherein the disease is retinitis 

pigmentosa, Stargardt's disease, diabetic retinopathies, retinal vascularization, or 
retinoblastoma. 
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23. The method of any of claims 10-22, wherein the mammaf is a 
human. 

24. The method of any of claims 10-22, wherein the viral nucleic acid 
comprises: 

5 an adenovirus inverted terminal repeat (ITR) sequences; and 

an adenovirus packaging signal operatively linked thereto. 

25. The method of claim 24, wherein the ITRs and packaging signal 
are derived from an adenovirus serotype B or C. 

26. The method of claim 24, wherein the ITRs and packaging signal 
10 are derived from an adenovirus type 2 or 5. 

27. The method of claim 24, wherein the viral nucleicvacid further 
comprises a photoreceptor-specific promoter. - ; v ^/ ^ % 

28. A method of targeted gene therapy, ^comprising: 
administering a recombinant viral vector that comprises an adenovirus 

15 type 37 fiber protein or portion thereof, whereby the vector selectively 
transduces photoreceptors and delivers a gene product encoded by the 
recombinant viral vector; wherein the portion is sufficient for selective binding to 
photoreceptors. 

29. The method of claim 28, wherein the vector is administered into 
20 the eye. 

30. The method of claim 28, wherein the vector is administered to the 
vitreous cavity of the eye. 

31 . The method of claim 28, wherein administration is effected by 
subretinal injection, intravenous administration, periorbital administration or 

25 intravitreal administration. 

32. The method of any of claims 10-31, wherein at least about 10 7 
plaque forming units of virus are administered. 

33. The method of any of claims 10-31, wherein about 1 plaque 
forming unit to about 10 14 plaque forming units of virus are administered. 
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SEQUENCE LISTING 

<110> VON SEGGERN, DANIEL 
NEMEROW, GLEN R. 
PRIEDLANDER, MARTIN 

<120> VECTORS FOR OCULAR TRANSDUCTION AND USE THEREFOR FOR GENETIC THERAPY 

<130> 756.1PCT/NOV0205P 

<140> 

<141> 2001-05-01 

<150> 09/562,934 
<151> 2000-05-01 

<160> 51 

<170> Patentln Ver. 2.1 
<210> 1 

<211> 30 - -V;v- 

<212> DNA 

<213> Artificial Sequence 
<220> ' c'y ft*': 

<223> Description of Artificial Sequence: primer 
<400> 1 

cggtacacag aattcaggag acacaactcc 30 



<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

gcctggatcc gggaagttac gtaacgtggg aaaac 35 



<210> 3 
<211> 12 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
<400> 3 

cgcggatccg eg 12 



<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: plasmid 
<400> 4 

cacctaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag 60 
ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac 120 
cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa agaacgtgga 180 
ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc 240 
accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga accctaaagg 300 
gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa 360 
gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc tgcgcgtaac 420 
caccacaccc gccgcgctta atgcgccgct acagggcgcg tcccattcgc cattcaggct 480 
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa 540 
agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 600 
ttgtaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg 660 
gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcagga gacacaactc 720 
caagtgcata ctctatgtca ttttcatggg actggtctgg ccacaactac attaatgaaa 780 
tatttgccac atcctcttac actttttcat acattgccca agaataaaga atcgtttgtg 840 
ttatgtttca acgtgtttat ttttcaattg cagaaaattt caagtcattt ttcattcagt 900 
agtatagccc caccaccaca tagcttatac agatcaccgt accttaatca aactcacaga 960 
accctagtat tcaacctgcc acctccctcc caacacacag agtacacagt cctttctccc 1020 
cggctggcct taaaaagcat catatcatgg gtaacagaca tattcttagg tgttatattc 1080 
cacacggttt cctgtcgagc caaacgctca tcagtgatat taataaactc cccgggcagc 1140 
tcacttaagt tcatgtcgct gtccagctgc tgagccacag gctgctgtcc aacttgcggt 1200 
tgcttaacgg gcggcgaagg agaagtccac gcctacatgg.; gggtagagtc ataatcgtgc 1260 
atcaggatag ggcggtggtg ctgcagcagc gcgcgaataa actgctgccg ccgccgctcc 1320 
gtcctgcagg aatacaacat ggcagtggtc tcctcagcga tgattcgcac cgcccgcagc 1380 
ataaggcgcc ttgtcctccg ggcacagcag cgcaccctga tctcacttaa atcagcacag 1440 
taactgcagc acagcaccac aatattgttc aaaatcccac agtgcaaggc gctgtatcca 1500 
aagctcatgg cggggaccac agaacccacg tggccatcat accacaagcg caggtagatt 1560 
aagtggcgac ccctcataaa cacgctggac ataaacatta cctcttttgg catgttgtaa 1620 
ttcaccacct cccggtacca tataaacctc tgattaaaca tggcgccatc caccaccatc 1680 
ctaaaccagc tggccaaaac ctgcccgccg gctatacact gcagggaacc gggactggaa 1740 
caatgacagt ggagagccca ggactcgtaa ccatggatca tcatgctcgt catgatatca 1800 
atgttggcac aacacaggca cacgtgcata cacttcctca ggattacaag ctcctcccgc 1860 
gttagaacca tatcccaggg aacaacccat tcctgaatca gcgtaaatcc cacactgcag 1920 
ggaagacctc gcacgtaact cacgttgtgc attgtcaaag tgttacattc gggcagcagc 1980 
ggatgatcct ccagtatggt agcgcgggtt tctgtctcaa aaggaggtag acgatcccta 2040 
ctgtacggag tgcgccgaga caaccgagat cgtgttggtc gtagtgtcat gccaaatgga 2100 
acgccggacg tagtcatatt tcctgaagca aaaccaggtg cgggcgtgac aaacagatct 2160 
gcgtctccgg tctcgccgct tagatcgctc tgtgtagtag ttgtagtata tccactctct 2220 
caaagcatcc aggcgccccc tggcttcggg ttctatgtaa actccttcat gcgccgctgc 2280 
cctgataaca tccaccaccg cagaataagc cacacccagc caacctacac attcgttctg 2340 
cgagtcacac acgggaggag cgggaagagc tggaagaacc atgttttttt ttttattcca 2400 
aaagattatc caaaacctca aaatgaagat ctattaagtg aacgcgctcc cctccggtgg 2460 
cgtggtcaaa ctctacagcc aaagaacaga taatggcatt tgtaagatgt tgcacaatgg 2520 
cttccaaaag gcaaacggcc ctcacgtcca agtggacgta aaggctaaac ccttcagggt 2580 
gaatctcctc tataaacatt ccagcacctt caaccatgcc caaataattc tcatctcgcc 2640 
accttctcaa tatatctcta agcaaatccc gaatattaag tccggccatt gtaaaaatct 2700 
gctccagagc gccctccacc ttcagcctca agcagcgaat catgattgca aaaattcagg 2760 
ttcctcacag acctgtataa gattcaaaag cggaacatta acaaaaatac cgcgatcccg 2820 
taggtccctt cgcagggcca gctgaacata atcgtgcagg tctgcacgga ccagcgcggc 2880 
cacttccccg ccaggaacct tgacaaaaga acccacactg attatgacac gcatactcgg 2940 
agctatgcta accagcgtag ccccgatgta agctttgttg catgggcggc gatataaaat 3000 
gcaaggtgct gctcaaaaaa tcaggcaaag cctcgcgcaa aaaagaaagc acatcgtagt 3060 
catgctcatg cagataaagg caggtaagct ccggaaccac cacagaaaaa gacaccattt 3120 
ttctctcaaa catgtctgcg ggtttctgca taaacacaaa ataaaataac aaaaaaacat 3180 
ttaaacatta gaagcctgtc ttacaacagg aaaaacaacc cttataagca taagacggac 3240 
tacggccatg ccggcgtgac cgtaaaaaaa ctggtcaccg tgattaaaaa gcaccaccga 3300 
cagctcctcg gtcatgtccg gagtcataat gtaagactcg gtaaacacat caggttgatt 3360 
catcggtcag tgctaaaaag cgaccgaaat agcccggggg aatacatacc cgcaggcgta 3420 
gagacaacat tacagccccc ataggaggta taacaaaatt aataggagag aaaaacacat 3480 
aaacacctga aaaaccctcc tgcctaggca aaatagcacc ctcccgctcc agaacaacat 3540 
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acagcgcttc acagcggcag cctaacagtc 
aaaaaacacc actcgacacg gcaccagctc 
gcagagcgag tatatatagg actaaaaaat 
ccagaaaacc gcacgcgaac ctacgcccag 
caaatcgtca cttccgtttt cccacgttac 
tctccgcaag aattgattgg ctccaattct 
ccggcttcca ttcaggtcga ggtggcccgg 
cagacaaggt atagggcggc gcctacaatc 
gcggcataaa tcgccgtgac gatcagcggt 
gcgagcgatc cttgaagctg tccctgatgg 
tgcaacgcgg gcatcccgat gccgccggaa 
cagcctcgcg tcgcgaacgc cagcaagacg 
ttcatccccg tggcccgttg ctcgcgtttg 
catgtcttta gttctatgat gacacaaacc 
aacacgcaga tgcagtcggg gcggcgcggt 
cgtgtggcct cgaacaccga gcgaccctgc 
ccgcagatcc cgggcaatga gatatgaaaa 
agtttctgat cgaaaagttc gacagcgtct 
aatctcgtgc tttcagcttc gatgtaggag 
gcgccgatgg tttctacaaa gatcgttatg 
cgattccgga agtgcttgac attggggaat 
gccgtgcaca - gggtgtcacg ttgcaagacc 
agccggtcgc. :ggaggccatg gatgcgatcg 
- tcggcccatt cggaccgcaa ggaatcggtc 
; cgattgctga tccccatgtg tatcactggc 
\-. ccgtcgcgca ggctctcgat gage t gat gc 
acctcgtgca cgeggattte ggctccaaca 
eggtcattga ctggagcgag gcgatgttcg 
tcttctggag gccgtggttg gcttgtatgg 
ateeggaget tgeaggateg ccgcggctcc 
aactctatca gagcttggtt gaeggcaatt 
gcgacgcaat cgtccgatcc ggagccggga 
gcgcggccgt ctggaccgat ggctgtgtag 
ccagcactcg tccgagggca aaggaatagg 
aggagacaat aceggaagga acccgcgcta 
acgggtgttg ggtcgtttgt teataaaege 
gataccccac cgagacccca ttggggccaa 
caccccccaa gttcgggtga aggeccaggg 
gccatagcca ctggccccgt gggttaggga 
tcgtgggggt tattattttg ggcgttgcgt 
cagacccatg gtttttggat ggcctgggca 
ccgggcgtct gtggctgcca aacacccccg 
cgcccagtgc cgtcgaccgg tcatggctgc 
gccctgacgg gcttgtctgc tcccggcatc 
gagctgeatg tgtcagaggt tttcaccgtc 
taatcageca taccacattt gtagaggttt 
cctgaacctg aaacataaaa tgaatgcaat 
taatggttac aaataaagca atagcatcac 
gcattctagt tgtggtttgt ccaaactcat 
' gttctagagc ggccgccacc gcggtggagc 
atttcgagct tggegtaate atggtcatag 
acaattccac acaacatacg ageeggaage 
gtgagctaac tcacattaat tgcgttgcge 
tcgtgccagc tgcattaatg aatcggccaa 
cgctcttccg cttcctcgct cactgactcg 
gtatcagctc actcaaaggc ggtaataegg 
aagaacatgt gagcaaaagg ccagcaaaag 
gcgtttttcc ataggctccg cccccctgac 
aggtggcgaa acccgacagg actataaaga 
gtgcgctctc ctgttccgac cctgccgctt 
ggaagcgtgg cgctttctca tagctcacgc 
cgctccaagc tgggctgtgt gcacgaaccc 



agccttacca gtaaaaaaga aaacctatta 3600 
aatcagtcac agtgtaaaaa agggecaagt 3660 
gaegtaaegg ttaaagtcca caaaaaacac 3720 
aaacgaaagc caaaaaaccc acaacttcct 3780 
gtaacttccc ggatccgegg cattcacagt 3840 
tggagtggtg aatccgttag cgaggtgccg 3900 
ctccatgcac cgcgacgcaa cgeggggagg 3960 
catgccaacc cgttccatgt gctcgccgag 4020 
ccagtgatcg aagttaggct ggtaagagee 4080 
tegtcatcta cctgcctgga cagcatggcc 4140 
gcgagaagaa tcataatggg gaaggecate 4200 
tagcccagcg cgtcggccgc catgccctgc 4260 
ctggcggtgt ccccggaaga aatatatttg 4320 
ccgcccagcg tcttgtcatt ggegaatteg 4380 
cccaggtcca ettegcatat taaggtgacg 4440 
agcgacccgc ttaacagcgt caacagegtg 4500 
agectgaact caccgcgacg tctgtcgaga 4560 
ccgacctgat gcagctctcg gagggegaag 4620 
ggcgtggata tgtcctgegg gtaaatagct .4680- . 
tttateggea etttgeateg gccgcgctcc '4740 
teagegagag cctgacctat tgcatctccc 4800 
tgectgaaac cgaactgccc gctgttctgc 4860 >: 
ctgcggccga tettagecag aegagegggt 492£. 
aatacactac atggcgtgat ttcatatgcg;. 49 ao. 
aaactgtgat ggacgacacc gtcagtgcgtksb^Q;-- v 
tttgggccga ggactgcccc gaagt coggers X-0O 
atgtcctgac ggacaatggc cgcataacaguSieo* 
gggattccca atacgaggtc gccaacatct 5220 
agcagcagac gcgctacttc gageggagge 52 80 
gggegtatat gctccgcatt ggtcttgacc 5340 
tegatgatge agcttgggcg cagggtcgat 5400 
ctgtcgggcg tacacaaatc gcccgcagaa 5460 
aagtactege cgatagtgga aaccgacgcc 5520 
ggagatgggg gaggctaact gaaacacgga 5580 
tgacggcaat aaaaagacag aataaaaege 5640 
ggggttcggt cccagggctg gcactctgtc 5700 
tacgcccgcg tttcttcctt ttccccaccc 5760 
ctcgcagcca aegtegggge ggcaggccct 5820 
cggggtcccc catggggaat ggtttatggt 5880 
ggggtctggt ccacgactgg actgagcaga 5940 
tggacegcat gtactggcgc gacacgaaca 6000 
acccccaaaa accaccgcgc ggatttctgg 6060 
gccccgacac ccgccaacac ccgctgacgc 6120 
cgcttacaga caagctgtga ccgtctccgg 6180 
atcaccgaaa cgcgcgaggc ageeggatea 6240 
tacttgettt aaaaaacctc cccacctccc 6300 
tgttgttgtt aacttgttta ttgeagctta 6360 
aaatttcaca aataaagcat ttttttcact 6420 
caatgtatct tatcatgtct ggatccacta 6480 
tccagctttt gttcccttta gtgagggtta 6540 
ctgtttcctg tgtgaaattg ttatccgctc 6600 
ataaagtgta aagcctgggg tgcctaatga 6660 
tcactgcccg ctttccagtc gggaaacctg 6720 
cgcgcgggga gaggcggttt gcgtattggg 6780 
ctgcgctcgg tegttegget gcggcgagcg 6840 
ttatccacag aabcagggga taacgcagga 6900 
gccaggaacc gtaaaaaggc cgcgttgctg 6960 
gagcatcaca aaaatcgacg ctcaagtcag 7020 
taccaggegt ttccccctgg aagctccctc 7080 
accggatacc tgtccgcctt tctcccttcg 7140 
tgtaggtatc tcagttcggt gtaggtcgtt 7200 
cccgttcagc ccgaccgctg cgccttatcc 7260 
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ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 7320 
actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 7380 
tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 7440 
gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 7500 
ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 7560 
cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 7620 
ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 7680 
tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 7740 
agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 7800 
gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 7860 
ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 7920 
gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 7980 
cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 8040 
acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 8100 
cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 8160 
cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 8220 
ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 8280 
tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 8340 
atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 8400 
tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 8460 
actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 8520 
aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 8580 
ctcatactct tcctttttca atattattga.agcatttatc agggttattg tctcatgagc 8640 
ggatacatat ttgaatgtat ttagaaaaat^ aaacaaatag gggttccgcg cacatttccc 8700 
cgaaaagtgc z : . :c x :::c^ij^ "■ * 8710 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

atgggatcca agatgaagcg cgcaagaccg 30 

<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description 'of Artificial Sequence: primer 
<400> 6 

cataacgcgg ccgcttcttt attcttgggc 30 

<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 7 

gacggatcgg gagatctccc .gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
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ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gagctcggat ccaagatgaa gcgcgcaaga 
ccatatgaca cggaaaccgg tcctccaact 
cccaatgggt ttcaagagag tccccctggg 
gttacctcca atggcatgct tgcgctcaaa 
ggcaacctta cctcccaaaa tgtaaccact 
aacataaacc tggaaatatc tgcacccctc 
gccgccgcac ctctaatggt cgcgggcaac 
accgtgcacg actccaaact tagcattgcc 
aagctagccc tgcaaacatc aggccccctc 
r actgcctcac cccctctaac tactgccact 
'J atttatacac aaaatggaaa actaggacta 
gacctaaaca ctttgaccgt agcaactggt 
caaactaaag ttactggagc cttgggtttt 
gcaggaggac taaggattga ttctcaaaac 
tttgatgctc aaaaccaact aaatctaaga 
gcccacaact tggatattaa ctacaacaaa 
tccaaaaagc ttgaggttaa cctaagcact 
atagccatta atgcaggaga tgggcttgaa 
cccctcaaaa caaaaattgg ccatggccta 
aaactaggaa ctggccttag ttttgacagc 
aatgataagc taactttgtg gaccacacca 
gagaaagatg ctaaactcac tttggtctta 
gtttcagttt tggctgttaa aggcagtttg 
catcttatta taagatttga cgaaaatgga 
gaatattgga actttagaaa tggagatctt 
ggatttatgc ctaacctatc agcttatcca 
attgtcagtc aagtttactt aaacggagac 
ctaaacggta cacaggaaac aggagacaca 
tgggactggt ctggccacaa ctacattaat 
tcatacattg cccaagaata aagaagcggc 
ctatagtgtc acctaaatgc tagagctcgc 
cagccatctg ttgtttgccc ctcccccgtg 
actgtccttt cctaataaaa tgaggaaatt 
attctggggg gtggggtggg gcaggacagc 
catgctgggg atgcggtggg ctctatggct 
agggggtatc cccacgcgcc ctgtagcggc 
cgcagcgtga ccgctacact tgccagcgcc 
tcctttctcg ccacgttcgc cggctttccc 
gggttccgat ttagtgcttt acggcacctc 
tcacgtagtg ggccatcgcc ctgatagacg 
ttctttaata gtggactctt gttccaaact 
tcttttgatt tataagggat tttggggatt 
taacaaaaat ttaacgcgaa ttaattctgt 
ccccaggctc cccaggcagg cagaagtatg 
aggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccatagtccc gcccctaact 
tccgcccatt ctccgcccca tggctgacta 
gcctctgcct ctgagctatt ccagaagtag 
tgcaaaaagc tcccgggagc ttgtatatcc 
ggatcgtttc gcatgattga acaagatgga 



atgtacgggc cagatatacg cgttgacatt 240 
ttacggggtc attagttcat agcccatata 300 
atggcccgcc tggctgaccg cccaacgacc 360 
ttcccatagt aacgccaata gggactttcc 420 
aaactgccca cttggcagta catcaagtgt 480 
tcaatgacgg taaatggccc gcctggcatt 540 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 720 
acaactccgc cccattgacg caaatgggcg 780 
gcagagctct ctggctaact agagaaccca 840 
ctcactatag ggagacccaa gcttggtacc 900 
ccgtctgaag ataccttcaa ccccgtgtat 960 
gtgccttttc ttactcctcc ctttgtatcc 1020 
gtactctctt tgcgcctatc cgaacctcta 1080 
atgggcaacg gcctctctct ggacgaggcc 114 0 
gtgagcccac ctctcaaaaa aaccaagtca 1200 
acagttacct cagaagccct aactgtggct 1260 
acactcacca tgcaatcaca ggccccgcta 1320 
acccaaggac ccctcacagt gtcagaagga 1380 
accaccaccg atagcagtac ccttactatc 1440 
ggtagcttgg gcattgactt gaaagaggcc 1500 
aagtacgggg ctcctttgca tgtaacagac 1560 
ccaggtgtga ctattaataa-.taettccttg 1620 
gattcacaag gcaatatgca /acttaatgta.y.1680: 
agacgcctta tacttgatgt tagtitatccg 174 Q 
ctaggacagg gccctctttt tataaactca -1800 , 
ggcctttact tgtttacagc ttcaaacaat 1860 
gccaaggggt tgatgtttga cgctacagcc 1920 = 
tttggttcac ctaatgcacc aaacacaaat 1980 
gaatttgatt caaacaaggc tatggttcct 2040 
acaggtgcca ttacagtagg aaacaaaaat 2100 
gctccatctc ctaactgtag actaaatgca 2160 
acaaaatgtg gcagtcaaat acttgctaca 2220 
gctccaatat ctggaacagt tcaaagtgct 2280 
gtgctactaa acaattcctt cctggaccca 2340 
actgaaggca cagcctatac aaacgctgtt 2400 
aaatctcacg gtaaaactgc caaaagtaac 2460 
aaaactaaac ctgtaacact aaccattaca 2520 
actccaagtg catactctat gtcattttca 2580 
gaaatatttg ccacatcctc ttacactttt 2640 
cgctcgagca tgcatctaga gggccctatt 2700 
tgatcagcct cgactgtgcc ttctagttgc 2760 
ccttccttga ccctggaagg tgccactccc 2820 
gcatcgcatt gtctgagtag gtgtcattct 2880 
aagggggagg attgggaaga caatagcagg 2940 
tctgaggcgg aaagaaccag ctggggctct 3000 
gcattaagcg cggcgggtgt ggtggttacg 3 060 
ctagcgcccg ctcctttcgc tttcttccct 3120 
cgtcaagctc taaatcgggg catcccttta 3180 
gaccccaaaa aacttgatta gggtgatggt 3240 
gtttttcgcc ctttgacgtt ggagtccacg 3300 
ggaacaacac tcaaccctat ctcggtctat 3360 
tcggcctatt ggttaaaaaa tgagctgatt 3420 
ggaatgtgtg tcagttaggg tgtggaaagt 3480 
caaagcatgc atctcaatta gtcagcaacc 3540 
ggcagaagta tgcaaagcat gcatctcaat 3600 
ccgcccatcc cgcccctaac tccgcccagt 3660 
atttttttta tttatgcaga ggccgaggcc 3720 
tgaggaggct tttttggagg cctaggcttt 3780 
attttcggat ctgatcaaga gacaggatga 3840 
ttgcacgcag gttctccggc cgcttgggtg 3900 
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gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg 3960 
ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc 4020 
ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct 4080 
tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa 4140 
gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg 4200 
gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa 4260 
gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat 4320 
gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg 4380 
cgcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc 4440 
atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac 4500 
cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg 4560 
gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc 4620 
tatcgccttc' ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag 4680 
cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg 4740 
gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatcfecatgc 4800 
tggagttctt cgcccacccc aacttgttta ttgcagctta taatggttac aaataaagca 4860 
atagcatcac aaatttcaca aataaagcat ttttttcact gcattctagt tgtggtttgt 4920 
ccaaactcat caatgtatct tatcatgtct gtataccgtc gacctctagc tagagcttgg 4980 
cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta tccgctcaca attccacaca 5040 
acatacgagc cggaagcata aagtgtaaag cctggggtgc ctaatgagtg agctaactca 5100 
cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc 5160 
attaatgaat cggccaacgc gqggggagag gcggtttgcg tattgggcgc tcttccgctt 5220 
cctcgctcac tgactcgqtg cgctcggtcg ttcggctgcg gcgagcggta tcagctcact 5280 
caaaggcggt aafeacggtta ..tccacagaat caggggataa cgcaggaaag aacatgtgag 5340 
caaaaggcca gcaaaaggec-aggaaccgta aaaaggccgc gttgctggcg tttttccata 5400 
ggctccgccc ccctgacgag catcacaaaa atcgacgctc aagtcagagg tggcgaaacc 5460 
cgacaggact ataaagatac caggcgtttc cccctggaag ctccctcgtg cgctctcctg 5520 
ttccgaccct gccgcttacc ggatacctgt ccgcctttct cccttcggga agcgtggcgc 5580 
tttctcaatg ctcacgctgt aggtatctca gttcggtgta ggtcgttcgc tccaagctgg 5640 
gctgtgtgca cgaacccccc gttcagcccg accgctgcgc cttatccggt aactatcgtc 5700 
ttgagtccaa cccggtaaga cacgacttat cgccactggc agcagccact ggtaacagga 5760 
ttagcagagc gaggtatgta ggcggtgcta cagagttctt gaagtggtgg cctaactacg 5820 
gctacactag aaggacagta tttggtatct gcgctctgct gaagccagtt accttcggaa 5880 
aaagagttgg tagctcttga tccggcaaac aaaccaccgc tggtagcggt ggtttttttg 5940 
tttgcaagca gcagattacg cgcagaaaaa aaggatctca agaagatcct ttgatctttt 6000 
ctacggggtc tgacgctcag tggaacgaaa actcacgtta agggattttg gtcatgagat 6060 
tatcaaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatct 6120 
aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt gaggcaccta 6180 
tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc gtgtagataa 6240 
ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg cgagacccac 6300 
gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc gagcgcagaa 6360 
gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg gaagctagag 6420 
taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca ggcatcgtgg 6480 
tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga tcaaggcgag 6540 
ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct ccgatcgttg 6600 
tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg cataattctc 6660 
ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca accaagtcat 6720 
tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata cgggataata 6780 
ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa 6840 
aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact cgtgcaccca 6900 
actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa acaggaaggc 6960 
aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc atactcttcc 7020 
tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg 7080 
aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac 7140 
ctgacgtc 7148 



<210> 8 
<211> 7469 
<212> DHA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 8 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 

gagctcggat 

gtctttccag 

gggacctgag 

agtcacagtc 

tgtttctggc 

tggtcgaggt 

gataccttca 

cttactcctc 

ttgcgcctat 

ggcctctctc 

cctctcaaaa 

tcagaagccc 

atgcaatcac 

cccctcacag 

gatagcagta 

ggcattgact ■ 

gctcctttgc 

actattaata 

ggcaatatgc 

atacttgatg 

ggccctcttt 

ttgtttacag 

ttgatgtttg 

cctaatgcac 

-tcaaacaagg 
attacagtag 
cctaactgta 
ggcagtcaaa 
tctggaacag 
aacaattcct 
acagcctata 
ggtaaaactg 

* cctgtaacac 
gcatactcta 
gccacatcct 
atgcatctag 
tcgactgtgc 
accctggaag 
tgtctgagta 
gattgggaag 
gaaagaacca 
gcggcgggtg 
gctcctttcg 



gagatctccc 
aagccagtat 
ttaagctaca 
gcgttttgcg 
tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
ctgaattcga 
tactcttgga 
cgagtccgca 
gcaaggtagg 
ggaggtgctg 
gaggtgtggc 
accccgtgta 
cctttgtatc 
ccgaacctct 
tggacgaggc 
aaaccaagtc 
taactgtggc 
aggccccgct 
tgtcagaagg 
cccttactat 
tgaaagagcc 
atgtaacaga 
atacttcctt 
aacttaatgt 
ttagttatcc 
ttataaactc 
cttcaaacaa 
acgctacagc 
caaacacaaa 
ctatggttcc 
gaaacaaaaa 
gactaaatgc 
tacttgctac 
ttcaaagtgc 
tcctggaccc 
caaacgctgt 
ccaaaagtaa 
taaccattac 
tgtcattttc 
cttacacttt 
agggccctat 
cttctagttg 
gtgccactcc 
ggtgtcattc 
acaatagcag 
gctggggctc 
tggtggttac 
ctttcttccc 



gatcccctat 
ctgctccctg 
acaaggcaag 
ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
gctcgctgtt 
tcggaaaccc 
tcgaccggat 
ctgagcaccg 
ctgatgatgt 
aggcttgaga 
tccatatgac 
ccccaatggg 
agttacctcc 
cggcaacctt 
aaacataaac 
tgccgccgca 
aaccgtgcac 
aaagctagcc 
cactgcctca 
catttataca 
cgacctaaac 
gcaaactaaa 
agcaggagga 
gtttgatgct 
agcccacaac 
ttccaaaaag 
catagccatt 
tcccctcaaa 
taaactagga 
taatgataag 
agagaaagat 
agtttcagtt 
tcatcttatt 
agaatattgg 
tggatttatg 
cattgtcagt 
actaaacggt 
atgggactgg 
ttcatacatt 
tctatagtgt 
ccagccatct 
cactgtcctt 
tattctgggg 
gcatgctggg 
tagggggtat 
gcgcagcgtg 
ttcctttctc 



ggtcgactct 

cttgtgtgtt 

gcttgaccga 

atgtacgggc 

ttacggggtc 

atggcccgcc 

ttcccatagt 

aaactgccca 

tcaatgacgg 

ctacttggca 

agtacatcaa 

ttgacgtcaa 

acaactccgc 

gcagagctct 

ctcactatag 

gggctcgcgg 

gtcggcctcc 

cggaaaacct 

tggcgggcgg 

aattaaagta vi 

tccaagatga;; 

acggaaaccg' 

tttcaagaga 

aatggcatgc 

acctcccaaa 

ctggaaatat 

cctctaatgg 

gactccaaac 

ctgcaaacat 

ccccctctaa 

caaaatggaa 

actttgaccg 

gttactggag 

ctaaggattg 

caaaaccaac 

ttggatatta 

cttgaggtta 

aatgcaggag 

acaaaaattg 

actggcctta 

ctaactttgt 

gctaaactca 

ttggctgtta 

ataagatttg 

aactttagaa 

cctaacctat 

caagtttact 

acacaggaaa 

tctggccaca 

gcccaagaat 

cacctaaatg 

gttgtttgcc 

tcctaataaa 

ggtggggtgg 

gatgcggtgg 

ccccacgcgc 

accgctacac 

gccacgttcg 



cagtacaatc 

ggaggtcgct 

caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ttgaggacaa 
gaacggtact 
ctcgagaaag 
cagcgggtgg 
grgcggtcttg 
agcgcgcaag 
gtcctccaac 
gtccccctgg 
ttgcgctcaa 
atgtaaccac 
ctgcacccct 
tcgcgggcaa 
ttagcattgc 
caggccccct 
ctactgccac 
aactaggact 
tagcaactgg 
ccttgggttt 
attctcaaaa 
taaatctaag 
actacaacaa 
acctaagcac 
atgggcttga 
gccatggcct 
gttttgacag 
ggaccacacc 
ctttggtctt 
aaggcagttt 
acgaaaatgg 
atggagatct 
cagcttatcc 
taaacggaga 
caggagacac 
actacattaa 
aaagaagcgg 
ctagagctcg 
cctcccccgt 
atgaggaaat 
ggcaggacag 
gctctatggc 
cctgtagcgg 
ttgccagcgc 
ccggctttcc 



tgctctgatg 
gagtagtgcg 
aagaatctgc 
cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gcttggtacc 
actcttcgcg 
ccgccaccga 
gcgtctaacc 
cggtcggggt 
agacggcgga 
accgtctgaa 
tgtgcctttt 
ggtactctct 
aatgggcaac 
tgtgagccca 
cacagttacc 
cacactcacc 
cacccaagga 
caccaccacc 
tggtagcttg 
aaagtacggg 
tccaggtgtg 
tgattcacaa 
cagacgcctt 
actaggacag 
aggcctttac 
tgccaagggg 
atttggttca 
agaatttgat 
cacaggtgcc 
agctccatct 
aacaaaatgt 
ggctccaata 
agtgctacta 
tactgaaggc 
aaaatctcac 
caaaactaaa 
aactccaagt 
tgaaatattt 
ccgctcgagc 
ctgatcagcc 
gccttccttg 
tgcatcgcat 
caagggggag 
ttctgaggcg 
cgcattaagc 
cctagcgccc 
ccigtcaagct 



60 

120 

180 

2.40 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

.2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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ctaaatcggg gcatcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 3540 
aaacttgatt agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 3600 
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 3660 
ctcaacccta tctcggtcta ttcttttgat ttataaggga ttttggggat ttcggcctat 3720 
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attaattctg tggaatgtgt 3780 
gtcagttagg gtgtggaaag tccccaggct ccccaggcag gcagaagtat gcaaagcatg 3840 
catctcaatt agtcagcaac caggtgtgga aagtccccag gctccccagc aggcagaagt 3900 
atgcaaagca tgcatctcaa ttagtcagca accatagtcc cgcccctaac tccgcccatc 3960 
ccgcccctaa ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt 4020 
atttatgcag aggccgaggc cgcctctgcc tctgagctat tccagaagta gtgaggaggc 4080 
ttttttggag gcctaggctt ttgcaaaaag ctcccgggag cttgtatatc cattttcgga 4140 
tctgatcaag agacaggatg aggatcgttt cgcatgattg aacaagatgg attgcacgca 4200 
ggttctccgg ccgcttgggt ggagaggcta ttcggctatg actgggcaca acagacaatc 4260 
ggctgctctg atgccgccgt gttccggctg tcagcgcagg ggcgcccggt tctttttgtc 4320 
aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg aggcagcgcg gctatcgtgg 4380 
ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg 4440 
gactggctgc tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct 4500 
gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct tgatccggct 4560 
acctgcccat tcgaccacca agcgaaacat cgcatcgagc gagcacgtac tcggatggaa 4620 
gccggtcttg tcgatcagga tgatctggac gaagagcatc aggggctcgc gccagccgaa 4680 
ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg atctcgtcgt gacccatggc 4740 ' 
gatgcctgct tg.ccgaatat catggtggaa aatggccgct tttctggatt catcgactgt 4800 
ggccggctgg.gtgtggcgga ccgctatcag gacatagcgt tggctacccg tgatattgct 4860 ■ 
gaagagcfctg^gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc 4f92tir>*w. r 
gatjtcgcagc],gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc gggactctgg :-?4&&'w4+n. 
ggttcgaaat gaccgaccaa gcgacgccca acctgccatc acgagatttc gattccaccg "Sti^or^vi 
ccgccttcta- tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc tggatgatcc 5100* f 
tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt attgcagctt 5160 
ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 5220 
tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tgtataccgt 5280 
cgacctctag ctagagcttg gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt 5340 
atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg 5400 
cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg 5460 
gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 5520 
gtattgggcg" ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 5580 
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 5640 
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 5700 
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 5760 
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 5820 
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 5880 
tcccttcggg aagcgtggcg ctttctcaat gctcacgctg taggtatctc agttcggtgt 5940 
aggtcgttcg ctccaagctg. ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 6000 
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 6060 
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 6120. 
tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 6180 
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 6240 
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 6300 
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 6360 
aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa 6420 
aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 6480 
gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 6540 
gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 6600 
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 6660 
ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 6720 
attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 6780 
ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 6840 
gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 6900 
ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 6960 
tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 7020 
gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 7080 
cggcgtcaat acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 7140 
gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 7200 
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tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 7260 
ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 7320 
gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 7380 
tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 7440 
catttccccg aaaagtgcca cctgacgtc 7469 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc ^ 28 

<210> 10 

<211> 23 . . 

<212> DNA 

<213> Artificial Sequence , * 

<220> ' . Vw-;^' 

<223> Description of Artificial-; Sequence:' primer 

<4oo> io -^^^ : 

ccgagctagc gactgaaaat gag - ^ ■ { > 23 

<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

cctctcgaga gacagcaaga cac 23 

<210> 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 12 

aagcttgggc agaaatggtt gaactcccga gagtgtccta cacctagggg agaagcagcc 60 
aa 9999ttgt ttcccaccaa ggacgacccg tctgcgcaca aacggatgag cccatcagac 120 
aaagacatat tcattctctg ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 
ggggaagttg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 
gtgcaagatt acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 
cagccaactt cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 
atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
a 99tggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 
ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt. caatggctat 720 
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aaagtgttat atagatccct cccttttcgt 
tgtatgttgt ctcaagaaga aaaagacgac 
ctaggaacag gaatgcactt ttggggaaag 
ggactaatag aacattattc tgcaaaaact 
ttggcccaac cttgcggttc ccagggctta 
aacgaggatg tgagacaagt ggtttcctga 
gctctgagtg ttctattttc ctatgttctt 
ttatgtaaac caagatataa aagagtgctg 
cattcacctc ttgtgtgttt gtgtctgttc 
tcactttcca gagggtcccc ccgcagaccc 
gctggcgccc gaacagggac cctcggataa 
tgtttgtctt gtattgtctc tttcttgtct 
ccatagggac caagctagcg actgaaaatg 
accgaagaaa tggccgccag tcttttggac 
cttccacctc ctagccattt tgaaccacct 
acggcccccg aagatcccaa cgaggaggcg 
ttggcggtgc aggaagggat tgacttactc 
ccgcctcacc tttcccggca gcccgagcag 
atgccaaacc ttgtaccgga ggtgatcgat 
agtgacgacg aggatgaaga gggtgaggag 
cacggttgca ggtcttgtca ttatcaccgg 
tcgctttgct atatgaggac ctgtggcatg 
gtgggtgata gagtggtggg tttggtgtgg 
iggtttaaaga attttgtatt gtgatttttt 
agcccgagcc agaaccggag cctgcaagac 
tcctgagacg cccgacatca cctgtgtcta 
actccggtcc ttctaacaca cctcctgaga 
aaccagttgc cgtgagagtt ggtgggcgtc 
ttaacgagcc tgggcaacct ttggacttga 
acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctgc ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gcttgtgagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 



gaaagactcg ccagagctag acctccttgg 780 
atgaaacaac aggtacatga ttatatttat 840 
attttccata ccaaggaggg gacagtggct 900 
catggcatga gttattatga atagccttta 960 
agtaagtttt tggttacaaa ctgttcttaa 1020 
cttggtttgg tatcaaaggt tctgatctga 1080 
ttggaattta tccaaatctt atgtaaatgc 1140 
attttttgag taaacttgca acagtcctaa 1200 
gccatcccgt ctccgctcgt cacttatcct 1260 
cggcgaccct caggtcggcc gactgcggca 1320 
gtgacccttg tctctatttc tactatttgg 1380 
ggctatcatc acaagagcgg aacggactca 1440 
agacatatta tctgccacgg aggtgttatt 1500 
cagctgatcg aagaggtact ggctgataat 1560 
acccttcacg aactgtatga tttagacgtg 1620 
gtttpgcaga tttttcccga ctctgtaatg 1680 
acttttccgc cggcgcccgg ttctccggag 1'740 
ccggagcaga gagccttggg tccggtttct 1800 
cttacctgcc acgaggctgg ctttccaccc 1860 
tttgtgttag attatgtgga gcaccccggg 1920 
aggaatacgg gggacccaga tattatgtgt IS 80 
tttgtctaca gtaagtgaaa attatgggca -£.p40 
taattttttt tttaattttt acagttttgt *ixQQ 
taaaaggtcc tgtgtctgaa ^cctgagcctg^^ieo 
ctacccgccg tcctaaaatg^gcgcctgcta -2220; ^ > 
gagaatgcaa tagtagtacgrgatagctgtg 2280 
tacacccggt ggtcccgctg tgccccatta 234 0? 
gccaggctgt ggaatgtatc gaggacttgc 2400 
gctgtaaacg ccccaggcca taaggtgtaa 2460 
tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 2640 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 2760 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2940 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggcgata ataccgacgg 3060 
ggcggcggcg gcaggagcag agcccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 3240 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagctgacc acttactggc tgcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 3480 
gaattgttgc tacatttctg ggaacggggc 3540 
ggcctttaga tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa 3660 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 3780 
ccccaaaagc agggcttcaa ttaagaaatg 3840 
gtctgagggt aactccaggg tgcgccacaa 3900 
agtgaaaagc gtggctgtga ttaagcataa 3960 
ctctcagatg ctgacctgct cggacggcaa 4020 
cagccactct cgcaaggcct ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 4140 
gatattgctt gagcccgaga gcatgtccaa 4200 
catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatcac ttggtgctgg cctgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 4440 
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cttaagggtg ggaaagaata tataaggtgg 
agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt. taacttgtfct 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatG-ttatcatgtc 
gggtgtggaa agtccccaggActccccagca 
tagtcagcaa ccaggtgtgg :aaa;gtcccca 
atgcatctca attagtcagc: aaccatagtc 
actccgccca gttccgccca ttctccgccc 
gaggccgagg ccgcctcggc ctctgagcta 
ggcctaggct tttgcaaaaa gcttggacac 
accactttat cccgcgtcag ggagaggcag 
actggttttt agtgcgccag atctctataa 
tttttaagcc gtagataaac aggctgggac 
ctgggacatg ttgcagatcc atgcacgtaa 
atggaaaggc att.attgccg taagccgtgg 
tgaactgggt attcgtcatg tcgataccgt 
gcgcgagctt aaagtgctga aacgcgcaga 
tgacctggtg gataccggtg gtactgcggt 
ctttgtcacc atcttcgcaa aaccggctgg 
tatcccgcaa gatacctgga ttgaacagcc 
aatctccggt cgctaatctt ttcaacgcct 
caggcgggtt acaatagttt ccagtaagta 
acctgagcga aaccctgttc aaaccccgct 
gccgctttaa tcacggcgca caaccgcctg 
ctcactggta tcgcatgatt aaccgtctga 
aaatcctcga cgtccaggca cgtattgtga 
tatacgatac ggtgattggc taccgtggcg 
tttgtgaagg aaccttactt ctgtggtgtg 
taaagctcta aggtaaatat aaaattttta 
attgtttgtg tattttagat tccaacctat 
gcctttaatg aggaaaacct gttttgctca 
actgctgact ctcaacattc tactcctcca 
gactttcctt cagaattgct aagttttttg 
gcttgctttg ctatttacac cacaaaggaa 
gaaaaatatt ctgtaacctt tataagtagg 
tttcttactc cacacaggca tagagtgtct 
acctttagct ttttaatttg taaaggggtt 
actagagatc ataatcagcc ataccacatt 
cccacacctc cccctgaacc tgaaacataa 
tattgcagct tataatggtt acaaataaag 
atttttttca ctgcattcta gttgtggttt 
ctggatcccc aggaagctcc tctgtgtcct 



•gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 4560 
gcgtcagaat gtgatgggct ccagcattga 4620 
taccttgacc tacgagaccg tgtctggaac 4680 
ttcagccgct gcagccaccg cccgcgggat 4740 
tgcaagcagt gcagcttccc gttcatccgc 4800 
acaattggat tctttgaccc gggaacttaa 4860 
ccagcaggtt tctgccctga aggcttcctc 4920 
aaaaccagac tctgtttgga tttggatcaa 4980 
ttgtgaagga accttacttc tgtggtgtga 5040 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ccaacctatg 5160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct actcctccaa 5280 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 5400 
aaaaatattc tgtaaccttt ataagtaggc 5460 
ttcttactcc acacaggcat agagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 5580 
ctagagatca taatcagcca taccacattt 5640 
ccacacctcc ccctgaacct gaaacataaa 5700 
attgcagctt ataatggtta caaataaagc 5760 
tttttttcac tgcattctag ttgtggtttg 5820 
tggatccggc- tgtggaatgt gtgtcagtta 5880 
ggcagaagta tgcaaagcat gcatctcaat 5940 
ggctccccag caggcagaag tatgcaaagc 6000 
j ccgcccctaa ctccgcccat cccgccccta 6060 
catggctgac taattttttt tatttatgca 6120 
ttccagaagt agtgaggagg cttttttgga 6180 
aagacaggct tgcgagatat gtttgagaat 6240 
tgcgtaaaaa gacgcggact, catgtgaaat 6300 
tctcgcgcaa cctattttcc cctcgaacac 6360 
acttcacatg agcgaaaaat acatcgtcac 6420 
actcgcaagc cgactgatgc cttctgaaca 6480 
cggtctggta ccgggtgcgt tactggcgcg 6540 
ttgtatttcc agctacgatc acgacaacca 6600 
aggcgatggc gaaggcttca tcgttattga 6660 
tgcgattcgt gaaatgtatc caaaagcgca 6720 
tcgtccgctg gttgatgact atgttgttga 6780 
gtgggatatg ggcgtcgtat tcgtcccgcc 6840 
ggcactgccg ggcgttgttc tttttaactt 6900 
ttctggaggc tgcatccatg acacaggcaa 6960 
ttaaacatcc tgaaacctcg acgctagtcc 7020 
tgcagtcggc ccttgatggt aaaaccatcc 7080 
tgtggatctg gcgcggcatt gacccacgcg . 7140 
tgagcgatgc cgaacgtacc gacgatgatt 7200 
gcaactggat ttatgagtgg gccccggatc 7260 
acataattgg acaaactacc tacagagatt 7320 
agtgtataat gtgttaaact actgattcta 7380 
ggaactgatg aatgggagca gtggtggaat 7440 
gaagaaatgc catctagtga tgatgaggct 7500 
aaaaagaaga gaaaggtaga agaccccaag 7560 
agtcatgctg tgtttagtaa tagaactctt 7620 
aaagctgcac tgctatacaa gaaaattatg 7680 
cataacagtt ataatcataa catactgttt 7740 
gctattaata actatgctca aaaattgtgt 7.800 
aataaggaat atttgatgta tagtgccttg 7860 
tgtagaggtt ttacttgctt taaaaaacct 7920 
aatgaatgca attgttgttg ttaacttgtt 7980 
caatagcatc acaaatttca caaataaagc 8040 
gtccaaactc atcaatgtat cttatcatgt 8100 
cataaaccct aacctcctct acttgagagg 8160 
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acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 8280 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg cccaacaccc tgctcatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc cagcactcca 8520 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8580 
caactgtagc attttttggg gttacagttt gagcaggata tttggtcctg tagtttgcta 8640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaccctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagtttaaca 8760 
tagcagttac- cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatatttc 8820 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 9240 
ccccgaagaa cgttttccaa tgatgagcac ■ ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 9360 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 9480 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggat.c atgtaactcg 9540 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9600 

^ gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct ' 9660 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 9720 

^gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg* 9900 
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 9960 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 10020 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 10080 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 10140 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 10200 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 10260 
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 10320 

•gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 10380 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 10440 
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 10500 
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 10560 
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 10620 
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 10680 
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 10740 
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 10800 
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 10860 
ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 10920 
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg 10980 
ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc cgctgacgcg 11040 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 11100 
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca gc 11152 

<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 13 

gacggatcgg gagatctcc 
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<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccgcctcaga agccatagag cc 22 



<210> 15 
<211> 14455 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 15 

aagcttgggc agaaatggtt;\-gaactcccga gagtgtccta cacctagggg agaagcagcc 60 
aaggggttgt ttcccaccaa v sgacgacccg tctgcgcaca aacggatgag cccatcagac 120 
aaagacatat • tcatt.ct.ctg- ctgcaaactt ggcatagctc tgctttgcct ggggctattg 180 ' ; # 
ggggaagtt:g: : csgttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct 240 * ^jv^?^;^ 
gtgcaagatt :acaatctaaa caattcggag aactcgacct tcctcctgag gcaaggacca 300 i^^&^S 1 ^ 
cagccaactt [cctcttacaa gccgcatcga ttttgtcctt cagaaataga aataagaatg 360 V po" ^ 
cttgctaaaa attatatttt taccaataag accaatccaa taggtagatt attagttact 420 
atgttaagaa atgaatcatt atcttttagt actattttta ctcaaattca gaagttagaa 480 
atgggaatag aaaatagaaa gagacgctca acctcaattg aagaacaggt gcaaggacta 540 
ttgaccacag gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
aggtggtggc aaccagggac ttatagggga ccttacatct acagaccaac agatgccccc 660 
ttaccatata caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 
aaagtgttat atagatccct cccttttcgt gaaagactcg ccagagctag acctccttgg 780 
tgtatgttgt ctcaagaaga aaaagacgac atgaaacaac aggtacatga ttatatttat 8.40 
ctaggaacag gaatgcactt ttggggaaag attttccata ccaaggaggg gacagtggct 900 
ggactaatag aacattattc tgcaaaaact catggcatga gttattatga atagccttta 960 
ttggcccaac cttgcggttc ccagggctta agtaagtttt tggttacaaa ctgttcttaa 1020 
aacgaggatg tgagacaagt ggtttcctga cttggtttgg tatcaaaggt tctgatctga 1080 
gctctgagtg ttctattttc ctatgttctt ttggaattta tccaaatctt atgtaaatgc 1140- 
ttatgtaaac caagatataa aagagtgctg attttttgag taaacttgca acagtcctaa 1200 
cattcacctc ttgtgtgttt gtgtctgttc gccatcccgt ctccgctcgt cacttatcct 1260 
tcactttcca gagggtcccc ccgcagaccc cggcgaccct caggtcggcc gactgcggca 1320 
gctggcgccc gaacagggac cctcggataa gtgacccttg tctctatttc tactatttgg 1380 
tgtttgtctt gtattgtctc tttcttgtct ggctatcatc acaagagcgg aacggactca 1440 
ccatagggac caagctagcg actgaaaatg agacatatta tctgccacgg aggtgttatt 1500 
accgaagaaa tggccgccag tcttttggac cagctgatcg aagaggtact ggctgataat 1560 
cttccacctc ctagccattt tgaaccacct acccttcacg aactgtatga tttagacgtg 1620 
acggcccccg aagatcccaa cgaggaggcg gtttcgcaga tttttcccga ctctgtaatg 1680 
ttggcggtgc aggaagggat tgacttactc acttttccgc cggcgcccgg ttctccggag 1740 
ccgcctcacc tttcccggca gcccgagcag ccggagcaga gagccttggg tccggtttct 1800 
atgccaaacc ttgtaccgga ggtgatcgat cttacctgcc acgaggctgg ctttccaccc 1860 
agtgacgacg aggatgaaga gggtgaggag tttgtgttag attatgtgga gcaccccggg 1920 
cacggttgca ggtcttgtca ttatcaccgg aggaatacgg gggacccaga tattatgtgt 1980 
tcgctttgct atatgaggac ctgtggcatg tttgtctaca gtaagtgaaa attatgggca 2040 
gtgggtgata gagtggtggg tttggtgtgg taattttttt tttaattttt acagttttgt 2100 
ggtttaaaga attttgtatt gtgatttttt taaaaggtcc tgtgtctgaa cctgagcctg 2160 
agcccgagcc agaaccggag cctgcaagac ctacccgccg tcctaaaatg gcgcctgcta 2220 
tcctgagacg cccgacatca cctgtgtcta gagaatgcaa tagtagtacg gatagctgtg 2280 
actccggtcc ttctaacaca cctcctgaga tacacccggt ggtcccgctg tgccccatta 2340 
aaccagttgc cgtgagagtt ggtgggcgtc gccaggctgt ggaatgtatc gaggacttgc 2400 
ttaacgagcc tgggcaacct ttggacttga gctgtaaacg ccccaggcca taaggtgtaa 2460 
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acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag . cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctgc ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gcttgtgagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
. cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 
cttaagggtg ggaaagaata tataaggtgg 
.agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt taacttgttt 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatc ttatcatgtc 
gggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccaggtgtgg aaagtcccca 
atgcatctca attagtcagc aaccatagtc 
actccgccca gttccgccca ttctccgccc 
gaggccgagg ccgcctcggc ctctgagcta 



tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 2640 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 2760 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2940 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggcgata ataccgacgg 3060 
ggcggcggcg gcaggagcag agcccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 3240 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagqtgacc acttactggc tgcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 3480 
gaattgttgc tacatttctg ggaacggggc 3540 
ggcctttaga . tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa 3660 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 3780 
ccccaaaago agggcttcaa ttaagaaatg 3840 
gtctgagggt •? aactccaggg tgcgccacaa 3900 
agtgaaaagc-' gtggctgtga. ttaagcataa 3960 
ctctcagatg ctgacctgct cggacggcaa 4020 
cagccactct i'cgcaaggGct • ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 4140 
gatattgctt gagcccgaga gcatgtccaa 4200 
catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatcac ttggtgctgg cctgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 4440 
gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 4560 
gcgtcagaat gtgatgggct ccagcattga 4620 
taccttgacc tacgagaccg tgtctggaac 4680 
ttcagccgct gcagccaccg cccgcgggat 4740 
tgcaagcagt gcagcttccc gttcatccgc 4800 
acaattggat tctttgaccc gggaacttaa 4860 
ccagcaggtt tctgccctga aggcttcctc 4920 
aaaaccagac tctgtttgga tttggatcaa 4980 
ttgtgaagga accttacttc tgtggtgtga 5040 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ccaacctatg 5160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct actcctccaa 5280 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 5400 
aaaaatattc tgtaaccttt ataagtaggc 5460 
ttcttactcc acacaggcat agagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 5580 
ctagagatca taatcagcca taccacattt 5640 
ccacacctcc ccctgaacct gaaacataaa 5700 
attgcagctt ataatggtta caaataaagc 5760 
tttttttcac tgcattctag ttgtggtttg 5820 
tggatccggc tgtggaatgt gtgtcagtta 5880 
ggcagaagta tgcaaagcat gcatctcaat 5940 
ggctccccag caggcagaag tatgcaaagc 6000 
ccgcccctaa ctccgcccat cccgccccta 6060 
catggctgac taattttttt tatttatgca 6120 
ttccagaagt agtgaggagg cttttttgga 6180 
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ggcctaggct tttgcaaaaa gcttggacac aagacaggct tgcgagatat gtttgagaat 6240 
accactttat cccgcgtcag ggagaggcag tgcgtaaaaa gacgcggact catgtgaaat 63 00 
actggttttt agtgcgccag atctctataa tctcgcgcaa cctattttcc cctcgaacac 6360 
tttttaagcc gtagataaac aggctgggac acttcacatg agcgaaaaat acatcgtcac 6420 
ctgggacatg ttgcagatcc atgcacgtaa actcgcaagc cgactgatgc cttctgaaca 6480 > 
atggaaaggc attattgccg taagccgtgg cggtctggta ccgggtgcgt tactggcgcg 6540 
tgaactgggt attcgtcatg tcgataccgt ttgtatttcc agctacgatc acgacaacca 6600 
gcgcgagctt aaagtgctga aacgcgcaga aggcgatggc gaaggcttca tcgttattga 6660 
tgacctggtg gataccggtg gtactgcggt tgcgattcgt gaaatgtatc caaaagcgca 6720 
ctttgtcacc atcttcgcaa aaccggctgg tcgtccgctg gttgatgact atgttgttga 6780 
tatcccgcaa gatacctgga ttgaacagcc gtgggatatg ggcgtcgtat tcgtcccgcc 6840 
aatctccggt cgctaatctt ttcaacgcct ggcactgccg ggcgttgttc tttttaactt 6900 
caggcgggtt acaatagttt ccagtaagta ttctggaggc tgcatccatg acacaggcaa 6960 
acctgagcga aaccctgttc aaaccccgct ttaaacatcc tgaaacctcg acgctagtcc 7020 
gccgctttaa tcacggcgca caaccgcctg tgcagtcggc ccttgatggt aaaaccatcc 70 80 
ctcactggta tcgcatgatt aaccgtctga tgtggatctg gcgcggcatt gacccacgcg 7140 
aaatcctcga cgtccaggca cgtattgtga tgagcgatgc cgaacgtacc gacgatgatt 7200 
tatacgatac ggtgattggc taccgtggcg gcaactggat ttatgagtgg gccccggatc 7260 
tttgtgaagg aaccttactt ctgtggtgtg acataattgg acaaactacc tacagagatt 7320 
taaagctcta aggtaaatat aaaattttta agtgtataat gtgttaaact actgattcta 7380 
attgtttgtg tattttagat tccaacctat ggaactgatg aatgggagca gtggtggaat 7440 
gcctttaatg..aggaaaacct gttttgctca gaagaaatgc catctagtga tgatgaggct 7500 
actgctgact:-ctcaacattc tactcctcca aaaaagaaga gaaaggtaga agaccccaag 7560 ; 
sgactttcctt* cagaattgct aagttttttg agtcatgctg tgtttagtaa tagaactcttyc762*)..'. v*. 
;:gcttgctttg ctatttacac cacaaaggaa aaagctgcac tgctatacaa gaaaattatg^6.8:0-v >. 
vgaaaaatatt ctgtaacctt tataagtagg cataacagtt ataatcataa catactgttt^T^O^ 
•tttcttactc cacacaggca tagagtgtct gctattaata actatgctca aaaattgtgt *-7800:* 
acctttagct ttttaatttg taaaggggtt aataaggaat atttgatgta tagtgccttg 7860 
actagagatc ataatcagcc ataccacatt tgtagaggtt ttacttgctt taaaaaacct 7920 
cccacacctc cccctgaacc tgaaacataa aatgaatgca attgttgttg ttaacttgtt 7980 
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 8040 
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 8100 
ctggatcccc aggaagctcc tctgtgtcct cataaaccct aacctcctct acttgagagg 8160 
acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 8280 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg cccaacaccc tgctcatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc cagcactcca 8520 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8580 
caactgtagc attttttggg gttacagttt gagcaggata tttggtcctg tagtttgcta 8640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaccctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagtttaaca 8760 
tagcagttac cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatatttc 8820 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 9240 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 9360 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 9480 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 9540 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9600 
gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 9660 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 9720 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 9900 
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tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 9960 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 10020 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 10080 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 10140 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 10200 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 10260 
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 10320 
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 10380 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 10440 
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 10500 
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 10560 
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 10620 
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 10680 
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 10740 
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 10800 
agctgatacc gctcgccgca gccgaacgac cgagggcagc gagtcagtga gcgaggaagc 10860 
ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 10920 
accgcctcag aagccataga gcccaccgca tccccagcat gcctgctatt gtcttcccaa 10980 
tcctccccct tgctgtcctg ccccacccca ccccccagaa tagaatgaca cctactcaga 11040 
caatgcgatg caatttcctc attttattag gaaaggacag tgggagtggc accttccagg 11100 
gtcaaggaag gcacggggga ggggcaaaca acagatggct ggcaactaga aggcacagtc 11160 
gaggctgatc agcgagctct agcatttagg- tgacactata gaatagggcc ctctagatgc 11220 
atgctcgagc ggccgcttct ttattcttgg^gcaatgtatg aaaaagtgta agaggatgtg 11280 
gcaaatattt cattaatgta gttgtggcGa'.'gaccagtccc atgaaaatga catagagtat 11340 
gcacttggag ttgtgtctcc tgt.ttc?:t:gt.;gtaccgttta gtgtaatggt tagtgttaca 11400 
ggtttagttt tgtctccgtt taagtaaafcti- tgactgacaa tgttactttt ggcagtttta 11460 
ccgtgagatt ttggataagc tgataggfcta ggcataaatc caacagcgtt tgtataggct 11520 
gtgccttcag taagatctcc atttctaaag ttccaatatt ctgggtccag gaaggaattg 11580 
tttagtagca ctccattttc gtcaaatctt ataataagat gagcactttg aactgttcca 11640 
gatattggag ccaaactgcc tttaacagcc aaaactgaaa ctgtagcaag tatttgactg 11700 
ccacattttg ttaagaccaa agtgagttta gcatctttct ctgcatttag tctacagtta 11760 
ggagatggag ctggtgtggt ccacaaagtt agcttatcat tatttttgtt tcctactgta 11820 
atggcacctg tgctgtcaaa actaaggcca gttcctagtt taggaaccat agccttgttt 11880 
gaatcaaatt ctaggccatg gccaattttt gttttgaggg gatttgtgtt tggtgcatta 11940 
ggtgaaccaa attcaagccc atctcctgca ttaatggcta tggctgtagc gtcaaacatc 12000 
aaccccttgg cagtgcttag gttaacctca agctttttgg aattgtttga agctgtaaac 12060 
aagtaaaggc ctttgttgta gttaatatcc aagttgtggg ctgagtttat aaaaagaggg 12120 
ccctgtccta gtcttagatt tagttggttt tgagcatcaa acggataact aacatcaagt 12180 
ataaggcgtc tgttttgaga atcaatcctt agtcctcctg ctacattaag ttgcatattg 12240 
ccttgtgaat caaaacccaa ggctccagta actttagttt gcaaggaagt attattaata 12300 
gtcacacctg gaccagttgc tacggtcaaa gtgtttaggt cgtctgttac atgcaaagga 12360 
gccccgtact ttagtcctag ttttccattt tgtgtataaa tgggctcttt caagtcaatg 12420 
cccaagctac cagtggcagt agttagaggg ggtgaggcag tgatagtaag ggtactgcta 12480 
tcggtggtgg tgagggggcc tgatgtttgc agggctagct ttccttctga cactgtgagg 12540 
ggtccttggg tggcaatgct aagtttggag tcgtgcacgg ttagcggggc ctgtgattgc 12600 
atggtgagtg tgttgcccgc gaccattaga ggtgcggcgg cagccacagt tagggcttct 1266Q 
gaggtaactg tgaggggtgc agatatttcc aggtttatgt ttgacttggt ttttttgaga 12720 
ggtgggctca cagtggttac attttgggag gtaaggttgc cggcctcgtc cagagagagg 12780 
ccgttgccca ttttgagcgc aagcatgcca ttggaggtaa ctagaggttc ggataggcgc 12840 
aaagagagta ccccaggggg actctcttga aacccattgg gggatacaaa gggaggagta 12900 
agaaaaggca cagttggagg accggtttcc gtgtcatatg gatacacggg gttgaaggta 12960 
tcttcagacg gtcttgcgcg cttcatcttg gatctcaagc ctgccacacc tcacctcgac 13020 
catccgccgt ctcaagaccg cctactttaa ttacatcatc agcagcacct ccgccagaaa 13080 
caaccccgac cgccacccgc tgccgcccgc cacggtgctc agcctacctt gcgactgtga 13140 
ctggttagac gcctttctcg agaggttttc cgatccggtc gatgcggact cgctcaggtc 13200 
cctcggtggc ggagtaccgt tcggaggccg acgggtttcc gatccaagag tactggaaag 13260 
accgcgaaga gtttgtcctc aaccgcgagc ccaacagcga gctcgaattc agatccgagc 13320 
tcggtaccaa gcttgggtct ccctatagtg agtcgtatta atttcgataa gccagtaagc 13380 
agtgggttct ctagttagcc agagagctct gcttatatag acctcccacc gtacacgcct 13440 
accgcccatt tgcgtcaatg gggcggagtt gttacgacat tttggaaagt cccgttgatt 13500 
ttggtgccaa aacaaactcc cattgacgtc aatggggtgg agacttggaa atccccgtga 13560 
gtcaaaccgc tatccacgcc cattgatgta ctgccaaaac cgcatcacca tggtaatagc 13620 
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gatgactaat acgtagatgt actgccaagt 
ataatgccag gcgggccatt taccgtcatt 
atacacttga tgtactgcca agtgggcagt 
atggaaagtc cctattggcg ttactatggg 
ggggtcgttg ggcggtcagc caggcgggcc 
catatatggg ctatgaacta atgaccccgt 
tcaatgtcaa cgcgtatatc tggcccgtac 
aagcagattc ttcatgcaat tgtcggtcaa 
cgcgcactac tcagcgacct ccaacacaca 
ggcatcagag cagattgtac tgagagtcga 
tctatggtgc actctcagta caatctgctc 
ccgctatcgc tacgtgactg ggtcatggct 
gcgccctgac gggcttgtct gctcccggca 
gggagctgca tgtgtcagag gttttcaccg 



aggaaagtcc cataaggtca tgtactgggc 13680 
gacgtcaata gggggcgtac ttggcatatg 13740 
ttaccgtaaa tagtccaccc attgacgtca 13800 
aacatacgtc attattgacg tcaatgggcg 13860 
atttaccgta agttatgtaa cgcggaactc 13920 
aattgattac tattaataac tagtcaataa 13980 
atcgcgaagc agcgcaaaac gcctaaccct 14040 
gccttgcctt gttgtagctt aaattttgct 14100 
agcagggagc agatactggc ttaactatgc 14160 
ccatagggga tcgggagatc tcccgatccg 14220 
tgatgccgca tagttaagcc agtatacact 14280 
gcgccccgac acccgccaac acccgctgac 14340 
tccgcttaca gacaagctgt gaccgtctcc 14400 
tcatcaccga aacgcgcgag gcagc 14455 



<210> 16 
<211> 10610 
<212> DNA 

<213> Artificial Sequence 

- t . '{<220> ,uX 
; • W"<223> Description of Artificial Sequence: plasmid T ;t;i # 

- <400>- 16^ ■ ^rr+rJfri:^. v - ■ 

^■:v^?v-r^ gacggatcgg gagatccgcg cggtacacag aattcaggag acacaactce'aa'gtgcatac 60 

* ^ tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat -atttgccaca 120 

tcctcttaca ctttttcata cattgcccaa gaataaagaa tcgtttgtgt tatgtttcaa 180 
cgtgtttatt tttcaattgc agaaaatttc aagtcatttt tcattcagta gtatagcccc 240 
accaccacat agcttataca gatcaccgta ccttaatcaa actcacagaa ccctagtatt 300 
caacctgcca cctccctccc aacacacaga gtacacagtc ctttctcccc ggctggcctt 360 
aaaaagcatc atatcatggg taacagacat attcttaggt gttatattcc acacggtttc 420 
ctgtcgagcc aaacgctcat cagtgatatt aataaactcc ccgggcagct cacttaagtt 480 
catgtcgctg tccagctgct gagccacagg ctgctgtcca acttgcggtt gcttaacggg 540 
cggcgaagga gaagtccacg cctacatggg ggtagagtca taatcgtgca tcaggatagg 600 
gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc cgccgctccg tcctgcagga 660 
atacaacatg gcagtggtct cctcagcgat gattcgcacc gcccgcagca taaggcgcct 720 
tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa tcagcacagt aactgcagca 780 
cagcaccaca atattgttca aaatcccaca gtgcaaggcg ctgtatccaa agctcatggc 840 
ggggaccaca gaacccacgt ggccatcata ccacaagcgc aggtagatta agtggcgacc 900 
cctcataaac acgctggaca taaacattac ctcttttggc atgttgtaat tcaccacctc 960 
ccggtaccat ataaacctct gattaaacat ggcgccatcc accaccatcc taaaccagct 1020 
ggccaaaacc tgcccgccgg ctatacactg cagggaaccg ggactggaac aatgacagtg 1080 
gagagcccag gactcgtaac catggatcat catgctcgtc atgatatcaa tgttggcaca 1140 
acacaggcac acgtgcatac acttcctcag gattacaagc tcctcccgcg ttagaaccat 1200 
atcccaggga acaacccatt cctgaatcag cgtaaatccc acactgcagg gaagacctcg 1260 
cacgtaactc acgttgtgca ttgtcaaagt gttacattcg ggcagcagcg gatgatcctc 1320 
cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga cgatccctac tgtacggagt 1380 
gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg ccaaatggaa cgccggacgt 1440 
agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca aacagatctg cgtctccggt 1500 
ctcgccgctt agatcgctct gtgtagtagt tgtagtatat ccactctctc aaagcatcca 1560 
ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg cgccgctgcc ctgataacat 1620 
ccaccaccgc agaataagcc acacccagcc aacctacaca ttcgttctgc gagtcacaca 1680 
cgggaggagc gggaagagct ggaagaacca tgtttttttt tttattccaa aagattatcc 1740 
aaaacctcaa aatgaagatc tattaagtga acgcgctccc ctccggtggc gtggtcaaac 1800 
tctacagcca aagaacagat aatggcattt gtaagatgtt gcacaatggc ttccaaaagg 1860 
caaacggccc tcacgtccaa gtggacgtaa aggctaaacc cttcagggtg aatctcctct 1920 
ataaacattc cagcaccttc aaccatgccc aaataattct catctcgcca ccttctcaat 1980 
atatctctaa gcaaatcccg aatattaagt ccggccattg taaaaatctg ctccagagcg 2040 
ccctccacct tcagcctcaa gcagcgaatc atgattgcaa aaattcaggt tcctcacaga 2100 
cctgtataag attcaaaagc ggaacattaa caaaaatacc gcgatcccgt aggtcccttc 2160 
gcagggccag ctgaacataa tcgtgcaggt ctgcacggac cagcgcggcc acttccccgc 2220 
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caggaacctt gacaaaagaa cccacactga 
ccagcgtagc cccgatgtaa gctttgttgc 
ctcaaaaaat caggcaaagc ctcgcgcaaa 
agataaaggc aggtaagctc cggaaccacc 
atgtctgcgg gtttctgcat aaacacaaaa 
aagcctgtct tacaacagga aaaacaaccc 
cggcgtgacc gtaaaaaaac tggtcaccgt 
tcatgtccgg agtcataatg taagactcgg 
gctaaaaagc gaccgaaata gcccggggga 
acagccccca taggaggtat aacaaaatta 
aaaccctcct gcctaggcaa aatagcaccc 
cagcggcagc ctaacagtca gccttaccag 
ctcgacacgg caccagctca atcagtcaca 
atatatagga ctaaaaaatg acgtaacggt 
cacgcgaacc tacgcccaga aacgaaagcc 
ttccgttttc ccacgttacg taacttcccg 
tcagtacaat ctgctctgat gccgcatagt 
tggaggtcgc tgagtagtgc gcgagcaaaa 
acaattgcat gaagaatctg cttagggtta 
ccagatatac gcgttgacat tgattattga 
cattagttca tagcccatat atggagttcc 
ctggctgacc gcccaacgac . ccccgcccat 
taacgccaat agggactttc cattgacgtc 
acttggcagt acat caagtg. ■ tat cat at gc 
gtaaatggcc cgcctggeat.v tatgcccagt 
agtacatcta eg t at- 1 agt c ; ,-at cgc t at t a 
atgggcgtgg atagcggb.tt gaetcaeggg 
atgggagttt gttttggcac caaaatcaac 
ccccattgac gcaaatgggc ggtaggcgtg 
tctggctaac tagagaaccc actgettact 
gggagaccca agcttggtac egagctegga 
gttgaggaca aactcttcgc ggtctttcca 
egaaeggtae tccgccaccg agggacctga 
tctcgagaaa ggegtctaac cagtcacagt 
geagegggtg gcggtcgggg ttgtttctgg 
aggeggtett gagaeggegg atggtcgagg 
aagcgcgcaa gaeegtctga agataccttc 
ggtcctccaa ctgtgccttt tcttactcct 
agtccccctg gggtactctc tttgegecta 
cttgcgctca aaatgggcaa cggcctctct 
aatgtaacca ctgtgagccc acctctcaaa 
tctgcacccc tcacagttac ctcagaagcc 
gtcgegggea acacactcac catgeaatea 
cttagcattg ccacccaagg acccctcaca 
tcaggccccc tcaccaccac cgatagcagt 
actactgcca ctggtagctt gggcattgac 
aaactaggac taaagtaegg ggctcctttg 
gtagcaactg gtccaggtgt gactattaat 
gccttgggtt ttgattcaca aggcaatatg 
gattctcaaa acagacgcct tatacttgat 
ctaaatctaa gactaggaca gggccctctt 
aactacaaca aaggecttta cttgtttaca 
aacctaagca ctgccaaggg gttgatgttt 
gatgggcttg aatttggttc acctaatgea 
ggccatggcc tagaatttga ttcaaacaag 
agttttgaca gcacaggtgc cattacagta 
tggaccacac cagctccatc tcctaactgt 
actttggtct taacaaaatg tggcagtcaa 
aaaggcagtt tggctccaat atctggaaca 
gacgaaaatg gagtgetact aaacaattcc 
aatggagatc ttactgaagg cacagcctat 
tcagcttatc caaaatctca eggtaaaact 



ttatgacacg catactegga getatgetaa 2280 
atgggcggcg atataaaatg caaggtgctg 2340 
aaagaaagca categtagtc atgctcatgc 2400 
acagaaaaag acaccatttt tctctcaaac 2460 
taaaataaca aaaaaacatt taaacattag 2520 
ttataagcat aagaeggact acggccatgc 2580 
gattaaaaag caccaccgac agctcctcgg 2640 
taaacacatc aggttgattc ateggtcagt 2700 
atacataccc geaggegtag agacaacatt 2760 
ataggagaga aaaacacata aacacctgaa 2820 
tcccgctcca gaacaacata cagcgcttca 2880 
taaaaaagaa aacctattaa aaaaacacca 2940 
gtgtaaaaaa gggccaagtg cagagegagt 3000 
taaagtccac aaaaaacacc cagaaaaccg 3060 
aaaaaaccca caacttcctc aaategtcac 3120 
gatcctctcc cgatccccta tggtcgactc 3180 
taagccagta tctgctccct gcttgtgtgt 3240 
tttaagctac aacaaggcaa ggcttgaccg 3300 
ggcgttttgc gctgcttcgc gatgtacggg 3360 
ctagttatta atagtaatca attaeggggt 3420 
gcgttacata acttaeggta aatggcccgc 3480 
tgacgtcaat aatgaegtat gttcccatag 3540 
aatgggtgga etatttaegg taaactgccc 3600 
caagtacgcc ccctattgac gtcaatgacg 3660 
acatgacctt atgggacttt cctacttggc 3720 
ccatggtgat gcggttttgg cagtacatca 3780 
gatttccaag tctccacccc attgaegtea 3840 
gggactttcc aaaatgtcgt aacaactccg 3900 
tacggtggga ggtctatata agcagagctc 3960 
ggcttatcga aattaatacg actcactata 4020 
tctgaattcg agetegctgt tgggctcgcg 4080 
gtactcttgg ateggaaace cgtcggcctc 4140 
gcgagtccgc ategacegga teggaaaace 4200 
cgcaaggtag gctgagcacc gtggcgggcg 4260 
eggaggtget gctgatgatg taattaaagt 4320 
tgaggtgtgg caggcttgag atccaagatg 4380 
aaccccgtgt atccatatga cacggaaacc 4440 
ccctttgtat cccccaatgg gtttcaagag 4500 
tccgaacctc tagttacctc caatggcatg 4560 
ctggacgagg ccggcaacct tacctcccaa 4620 
aaaaccaagt caaacataaa cctggaaata 4680 
ctaactgtgg ctgccgccgc acctctaatg 4740 
caggccccgc taaccgtgca cgactccaaa 4800 
gtgtcagaag gaaagctagc cctgcaaaca 4860 
acccttacta tcactgcctc accccctcta 4920 
ttgaaagagc ccatttatac acaaaatgga 4980 
catgtaacag acgacctaaa cactttgacc 5040 
aatacttcct tgcaaactaa agttactgga 5100 
caacttaatg tagcaggagg actaaggatt 5160 
gttagttatc cgtttgatgc tcaaaaccaa 5220 
tttataaact cagcccacaa cttggatatt 5280 
gcttcaaaca attccaaaaa gcttgaggtt 5340 
gaegctacag ccatagccat taatgcagga 5400 
ccaaacacaa atcccctcaa aacaaaaatt 5460 
gctatggttc ctaaactagg aactggcett 5520 
ggaaacaaaa ataatgataa gctaactttg 5.580 
agactaaatg cagagaaaga tgetaaaetc 5640 
atacttgeta cagtttcagt tttggctgtt 5700 
gttcaaagtg ctcatcttat tataagattt 5760 
ttcctggacc cagaatattg gaactttaga 5820 
acaaaegctg ttggatttat gcctaaccta 5880 
gecaaaagta acattgtcag tcaagtttac 5940 
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ttaaacggag acaaaactaa acctgtaaca 
acaggagaca caactccaag tgcatactct 
aactacatta atgaaatatt tgccacatcc 
taaagaagcg gccgctcgag catgcatcta 
gctagagctc gctgatcagc ctcgactgtg 
ccctcccccg tgccttcctt gaccctggaa 
aatgaggaaa ttgcatcgca ttgtctgagt 
gggcaggaca gcaaggggga ggattgggaa 
ggctctatgg cttctgaggc ggaaagaacc 
ccctgtagcg gcgcattaag cgcggcgggt 
cttgccagcg ccctagcgcc cgctcctttc 
gccggctttc cccgtcaagc tctaaatcgg 
ttacggcacc tcgaccccaa aaaacttgat 
ccctgataga cggtttttcg ccctttgacg 
ttgttccaaa ctggaacaac actcaaccct 
attttgggga tttcggccta ttggttaaaa 
aattaattct gtggaatgtg tgtcagttag 
ggcagaagta tgcaaagcat gcatctcaat 
ggctccccag caggcagaag tatgcaaagc 
ccgcccctaa ctccgcccat cccgccccta 
catggctgac taattttttt tatttatgca 
ttccagaagt agtgaggagg cttttttgga 
gcttgtatat ccattttcgg atctgatcaa 
gaacaagatg gattgcacgc aggttctccg 
gactgggcac aacagacaat cggctgctct 
gggcgcccgg ttctttttgt caagaccgac 
gaggcagcgc ggctatcgtg gctggccacg 
gttgtcactg aagcgggaag ggactggctg 
ctgtcatctc accttgctcc tgccgagaaa 
ctgcatacgc ttgatccggc tacctgccca 
cgagcacgta ctcggatgga agccggtctt 
caggggctcg cgccagccga actgttcgcc 
gatctcgtcg tgacccatgg cgatgcctgc 
ttttctggat tcatcgactg tggccggctg 
ttggctaccc gtgatattgc tgaagagctt 
ctttacggta tcgccgctcc cgattcgcag 
ttcttctgag cgggactctg gggttcgaaa 
cacgagattt cgattccacc gccgccttct 
gggacgccgg ctggatgatc ctccagcgcg 
ccaacttgtt tattgcagct tataatggtt 
caaataaagc atttttttca ctgcattcta 
cttatcatgt ctgtataccg tcgacctcta 
tgtttcctgt gtgaaattgt tatccgctca 
taaagtgtaa agcctggggt gcctaatgag 
cactgcccgc tttccagtcg ggaaacctgt 
gcgcggggag aggcggtttg cgtattgggc 
tgcgctcggt cgttcggctg cggcgagcgg 
tatccacaga. atcaggggat aacgcaggaa 
ccaggaaccg taaaaaggcc gcgttgctgg 
agcatcacaa aaatcgacgc tcaagtcaga 
accaggcgtt tccccctgga agctccctcg 
ccggatacct gtccgccttt ctcccttcgg 
gtaggtatct cagttcggtg taggtcgttc 
ccgttcagcc cgaccgctgc gccttatccg 
gacacgactt atcgccactg gcagcagcca 
taggcggtgc tacagagttc ttgaagtggt 
tatttggtat ctgcgctctg ctgaagccag 
gatccggcaa acaaaccacc gctggtagcg 
cgcgcagaaa aaaaggatct caagaagatc 
agtggaacga aaactcacgt taagggattt 
cctagatcct tttaaattaa aaatgaagtt 
cttggtctga cagttaccaa tgcttaatca 



ctaaccatta cactaaacgg tacacaggaa 6000 
atgtcatttt catgggactg gtctggccac 6060 
tcttacactt tttcatacat tgcccaagaa 6120 
gagggcccta ttctatagtg tcacctaaat 6180 
ccttctagtt gccagccatc tgttgtttgc 6240 
ggtgccactc ccactgtcct ttcctaataa 6300 
aggtgtcatt ctattctggg gggtggggtg 6360 
gacaatagca ggcatgctgg ggatgcggtg 6420 
agctggggct ctagggggta tccccacgcg 6480 
gtggtggtta cgcgcagcgt gaccgctaca 6540 
gctttcttcc cttcctttct cgccacgttc 6600 
ggcatccctt tagggttccg atttagtgct 6660 
tagggtgatg gttcacgtag tgggccatcg 6720 
ttggagtcca cgttctttaa tagtggactc 6780 
atctcggtct attcttttga tttataaggg 6840 
aatgagctga tttaacaaaa atttaacgcg 6900 
ggtgtggaaa gtccccaggc tccccaggca 6960 
tagtcagcaa ccaggtgtgg aaagtcccca 7020 
atgcatctca attagtcagc aaccatagtc 7080 
actccgccca gttccgccca ttctccgccc 7140 
gaggccgagg ccgcctctgc ctctgagcta 7200 
ggcctaggct tttgcaaaaa cgctcccggga 7260 
gagacaggat gaggatcgtty;£cgcatgatt 7320 
gccgcttggg. tggagaggct attcggctat 7380 
gatgccgccg -tgttccggct gtcagcgcag 7440 
ctgtccggtg -ccctgaatga actgcaggac 7500 
acgggcgttc . cttgcgcagc tgtgctcgac 7560 
ctattgggcg aagtgccggg gcaggatctc 7620 
gtatccatca tggctgatgc aatgcggcgg 7680 
ttcgaccacc aagcgaaaca tcgcatcgag 7740 
gtcgatcagg atgatctgga cgaagagcat 7800 
aggctcaagg cgcgcatgcc cgacggcgag 7860 
ttgccgaata tcatggtgga aaatggccgc 7920 
Sgtgtggcgg accgctatca ggacatagcg 7980 
ggcggcgaat gggctgaccg cttcctcgtg 8O40 
cgcatcgcct tctatcgcct tcttgacgag 8100 
tgaccgacca agcgacgccc aacctgccat 8160 
atgaaaggtt gggcttcgga atcgttttcc 8220 
gggatctcat gctggagttc ttcgcccacc 8280 
acaaataaag caatagcatc acaaatttca 8340 
gttgtggttt gtccaaactc atcaatgtat 8400 
gctagagctt ggcgtaatca tggtcatagc 8460 
caattccaca caacatacga gccggaagca .8520 
tgagctaact cacattaatt gcgttgcgct 8580 
cgtgccagct gcattaatga atcggccaac 8640 
gctcttccgc ttcctcgctc actgactcgc 8700 
tatcagctca ctcaaaggcg gtaatacggt 8760 
agaacatgtg agcaaaaggc cagcaaaagg 8820 
cgtttttcca taggctccgc ccccctgacg 8 880 
ggtggcgaaa cccgacagga ctataaagat 8940 
tgcgctctcc tgttccgacc ctgccgctta 9000 
gaagcgtggc gctttctcaa tgctcacgct 9060 
gctccaagct gggctgtgtg cacgaacccc 9120 
gtaactatcg tcttgagtcc aacccggtaa 9180 
ctggtaacag gattagcaga gcgaggtatg 9240 
ggcctaacta cggctacact agaaggacag 9300 
ttaccttcgg aaaaagagtt ggtagctctt 9360 
gtggtttttt tgtttgcaag cagcagatta 9420 
ctttgatctt ttctacgggg tctgacgctc 9480 
tggtcatgag attatcaaaa aggatcttca 9540 
ttaaatcaat ctaaagtata tatgagtaaa 9600 
gtgaggcacc tatctcagcg atctgtctat 9660 
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ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 9720 
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 9780 
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 9840 
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 9900 
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 9960 
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 10020 
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 10080 
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 10140 
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 10200 
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 10260 
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 10320 
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 10380 
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 10440 
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 10500 
gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 10560 
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc ' 10610 



<210> 17 . 
<211> 24 - 
<212> DNA 

<213> Artificial ^Sequence 

<220> v.:-,.: 

<223*., Description of Artificial Sequence: Primer 

/i? •":.<*- * ' 
<400> 17 ^ r. . \ 

tgtacaccgg atccggcgca cacc 

<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

cacaacgagc tcaattaatt aattgccaca tcctc 35 

<210> 19 

<211> 4 

<212> PRT 

<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 

<210> 20 
<211> 12' 
<212> PRT 
<213> adenovirus 

<400> 20 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Gly Ser 
1 5 10 



24 



<210> 22 
<211> 327 
<212> DNA 
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<213> adenovirus 
<400> 22 

agatctgaat tcgagctcgc tgttgggctc gcggttgagg acaaactctt cgcggtcttt 60 
ccagtactct tggatcggaa acccgtcggc ctccgaacgg tactccgcca ccgagggacc 120 
tgagcgagtc cgcatcgacc ggatcggaaa acctctcgag aaaggcgtct aaccagtcac 180 
agtcgcaagg taggctgagc accgtggcgg gcggcagcgg gtggcggtcg gggttgtttc 240 
tggcggaggt gctgctgatg atgtaattaa agtaggcggt cttgagacgg cggatggtcg 300 
aggtgaggtg tggcaggctt gagatct 327 

<210> 23 
<211> 32480 
<212> DNA 
<213> adenovirus 

<400> 23 ^ 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60. 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa.tctctagcat cgatgtcgac 360 
aagcttgaat tcgattaatg tgagttagct cactcattag : gcaccccagg ctttacactt 420 
tatgcttccg gctcgtatgt tgtgtggaat ,tgtgagcgga taacaatttc acacaggaaa 480 
cagctatgac catgattacg aattcggcgCvagcaccatgg cctgaaataa cctctgaaag. 540 
aggaacttgg ttaggtacct tctgaggcgg aaagaaccag ctgtggaatg tgtgtcagtt 600 
agggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa 660 
ttagtcagca accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag 720 
catgcatctc aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct 780 
aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc 840 
agaggccgag gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg 900 
aggcctaggc ttttgcaaaa agcttgggat ctctataatc tcgcgcaacc tattttcccc 960 
tcgaacactt tttaagccgt agataaacag gctgggacac ttcacatgag- cgaaaaatac 1020 
atcgtcacct gggacatgtt gcagatccat gcacgtaaac tcgcaagccg actgatgcct 1080 
tctgaacaat ggaaaggcat tattgccgta agccgtggcg gtctggtacc ggtgggtgaa 1140 
gaccagaaac agcacctcga actgagccgc gatattgccc agcgtttcaa cgcgctgtat 1200 
ggcgagatcg atcccgtcgt tttacaacgt cgtgactggg aaaaccctgg cgttacccaa 1260 
cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga agaggcccgc 1320 
accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgctt tgcctggttt 1380 
. ccggcaccag aagcggtgcc ggaaagctgg ctggagtgcg atcttcctga ggccgatact 1440 
gtcgtcgtcc cctcaaactg gcagatgcac ggttacgatg cgcccatcta caccaacgta 1500 
acctatccca ttacggtcaa tccgccgttt gttcccacgg agaatccgac gggttgttac 1560 
tcgctcacat ttaatgttga tgaaagctgg ctacaggaag gccagacgcg aattattttt 1620 
gatggcgtta actcggcgtt tcatctgtgg tgcaacgggc gctgggtcgg ttacggccag 1680 
gacagtcgtt tgccgtctga atttgacctg agcgcatttt tacgcgccgg agaaaaccgc 1740 
ctcgcggtga tggtgctgcg ttggagtgac ggcagttatc tggaagatca ggatatgtgg 1800 
cggatgagcg gcattttccg tgacgtctcg ttgctgcata aaccgactac acaaatcagc 1860 
gatttccatg ttgccactcg ctttaatgat gatttcagcc gcgctgtact ggaggctgaa 1920 
gttcagatgt gcggcgagtt gcgtgactac ctacgggtaa cagtttcttt atggcagggt 1980 
gaaacgcagg tcgccagcgg caccgcgcct ttcggcggtg aaattatcga tgagcgtggt 2040 
ggttatgccg atcgcgtcac actacgtctg aacgtcgaaa acccgaaact gtggagcgcc 2100 
gaaatcccga atctctatcg tgcggtggtt gaactgcaca ccgccgacgg cacgctgatt 2160 
gaagcagaag cctgcgatgt cggtttccgc gaggtgcgga ttgaaaatgg tctgctgctg 2220 
ctgaacggca agccgttgct gattcgaggc gttaaccgtc acgagcatca tcctctgcat 2280 
ggtcaggtca tggatgagca gacgatggtg caggatatcc tgctgatgaa gcagaacaac 2340 
tttaacgccg tgcgctgttc gcattatccg aaccatccgc tgtggtacac gctgtgcgac 2400 
cgctacggcc tgtatgtggt. ggatgaagcc aatattgaaa cccacggcat ggtgccaatg 2460 
aatcgtctga ccgatgatcc gcgctggcta ccggcgatga gcgaacgcgt aacgcgaatg 2520 
gtgcagcgcg atcgtaatca cccgagtgtg atcatctggt cgctggggaa tgaatcaggc 2580 
cacggcgcta atcacgacgc gctgtatcgc tggatcaaat ctgtcgatcc ttcccgcccg 2640 
gtgcagtatg aaggcggcgg agccgacacc acggccaccg atattatttg cccgatgtac 2700 
gcgcgcgtgg atgaagacca gcccttcccg gctgtgccga aatggtccat caaaaaatgg 2760 
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ctttcgctac ctggagagac gcgcccgctg atcctttgcg aatacgccca cgcgatgggt 2820 
aacagtcttg gcggtttcgc taaatactgg caggcgtttc gtcagtatcc ccgtttacag 2880 
ggcggcttcg tctgggactg ggtggatcag tcgctgatta aatatgatga aaacggcaac 2940 
ccgtggtcgg cttacggcgg tgattttggc gatacgccga acgatcgcca gttctgtatg 3000 
aacggtctgg tctttgccga ccgcacgccg catccagcgc tgacggaagc aaaacaccag 3060 
cagcagtttt tccagttccg tttatccggg caaaccatcg aagtgaccag cgaatacctg 3120 
ttccgtcata gcgataacga gctcctgcac tggatggtgg cgctggatgg 1 taagccgctg 3180 
gcaagcggtg aagtgcctct ggatgtcgct ccacaaggta aacagttgat tgaactgcct 3240 
gaactaccgc agccggagag cgccgggcaa ctctggctca cagtacgcgt agtgcaaccg 3300 
aacgcgaccg catggtcaga agccgggcac atcagcgcct ggcagcagtg gcgtctggcg 3360 
gaaaacctca gtgtgacgct ccccgccgcg tcccacgcca tcccgcatct gaccaccagc 3420 
gaaatggatt tttgcatcga gctgggtaat aagcgttggc aatttaaccg ccagtcaggc 3480 
tttctttcac agatgtggat tggcgataaa aaacaactgc tgacgccgct gcgcgatcag 3540 
ttcacccgtg caccgctgga taacgacatt ggcgtaagtg aagcgacccg cattgaccct 3 600 
aacgcctggg tcgaacgctg gaaggcggcg ggccattacc aggccgaagc agcgttgttg 3 660 
cagtgcacgg cagatacact tgctgatgcg gtgctgatta cgaccgctca cgcgtggcag 3720 
catcagggga aaaccttatt tatcagccgg aaaacctacc ggattgatgg tagtggtcaa 3780 
atggcgatta ccgttgatgt tgaagtggcg agcgatacac cgcatccggc gcggattggc -3840 
ctgaactgcc agctggcgca ggtagcagag cgggtaaact ggctcggatt agggccgcaa ~3'900 
gaaaactatc ccgaccgcct tactgccgcc tgttttgacc gctgggatct gccattgtca 3960 
gacatgtata ccccgtacgt cttcccgagc gaaaacggtc tgcgctgcgg gacgcgcgaa 4O20 
ttgaattatg gcccacacca gtggcgcggc gacttccagt tcaacatcag ccgctacagt ^080: r > r; 
caacagcaac tgatggaaac cagccatcgc catctgctgc acgcggaaga aggcacatgg ?40;4Q * 
jr.ctgaatatcg acggtttcca tatggggatt ggtggcgacg actcctggag cccgtcagta^^O e 
; tcggcggaat t c cage t gag cgccggtcgc taccattacc agttggtctg gtgtcaaaaa^'i^O 
-taataataac cgggcaggcc atgtctgccc gtatttcgcg taaggaaatc cattatgtac^^O 
tatttaaaaa. acacaaactt ttggatgttc ggtttattct ttttctttta ctttttta'tc "4380 
atgggagcct acttcccgtt tttcccgatt tggctacatg acatcaacca tatcagcaaa 4440 
agtgatacgg gtattatttt tgeegctatt tctctgttct cgctattatt ccaaccgctg 4500 
tttggtctgc tttctgacaa acteggaact tgtttattgc agcttataat ggttacaaat 4560 
aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgeat tctagttgtg 4620 
gtttgtccaa actcatcaat gtatcttatc atgtctggat ccagatctgg gcgtggctta 4680 
agggtgggaa agaatatata aggtgggggt cttatgtagt tttgtatctg ttttgeagea 4740 
gccgccgccg ccatgagcac caactcgttt gatggaagca ttgtgagctc atatttgaca 4800 
acgcgcatgc ccccatgggc cggggtgcgt cagaatgtga tgggctccag cattgatggt 4860 
cgccccgtcc tgcccgcaaa ctctactacc ttgacctacg agaccgtgtc tggaacgccg 4920 
ttggagactg cagcctccgc cgccgcttca gccgctgcag ccaccgcccg cgggattgtg 4980 
actgactttg ctttcctgag cccgcttgca agcagtgcag cttcccgttc atccgcccgc 5040 
gatgacaagt tgaeggctet tttggcacaa ttggattctt tgacccggga acttaatgtc 5100 
gtttctcagc agctgttgga tctgcgccag caggtttctg ccctgaaggc ttcctcccct 5160 
cccaatgcgg tttaaaacat aaataaaaaa ccagactctg tttggatttg gatcaagcaa 5220 
gtgtcttgct gtctttattt aggggttttg cgcgcgcggt aggeceggga ccagcggtct 5280 
cggtcgttga gggtcctgtg tattttttcc aggacgtggt aaaggtgact ctggatgttc 5340 
agatacatgg gcataagccc gtctctgggg tggaggtagc accactgcag agettcatge 5400 
tgcggggtgg tgttgtagat gatccagtcg tagcaggagc gctgggcgtg gtgcctaaaa 5460 
atgtctttca gtagcaagct gattgecagg ggcaggccct tggtgtaagt gtttacaaag 5520 
eggttaaget gggatgggtg catacgtggg gatatgagat gcatcttgga ctgtattttt 5580 
aggttggcta tgttcccagc catatccctc eggggattea tgttgtgcag aaccaccagc 5640 
acagtgtatc eggtgeaett gggaaatttg tcatgtagct tagaaggaaa tgcgtggaag 5700 
aacttggaga cgcccttgtg acctccaaga ttttccatgc attegtccat aatgatggca 5760 
atgggcccac gggeggegge ctgggcgaag atatttctgg gatcactaac gtcatagttg 5820 
tgttccagga tgagatcgtc ataggecatt tttacaaagc gegggeggag ggtgccagac 5880 
tgcggtataa tggttccatc cggcccaggg gcgtagttac cctcacagat ttgeatttec 5940 
caegctttga gttcagatgg ggggatcatg tctacctgcg gggcgatgaa gaaaacggtt 6000 
teeggggtag gggagatcag ctgggaagaa agcaggttcc tgagcagctg cgacttaccg 6060 
cagccggtgg gecegtaaat cacacctatt acegggtgea actggtagtt aagagagctg 6120 
cagctgccgt catccctgag caggggggee acttcgttaa gcatgtccct gaetegcatg 6180 
ttttccctga ccaaatccgc cagaaggege tcgccgccca gcgatagcag ttcttgcaag 6240 
gaagcaaagt ttttcaaegg tttgagaccg tccgccgtag geatgetttt gagcgtttga 6300 
ccaagcagtt ccaggcggtc ccacagctcg gtcacctgct ctacggcatc tcgatccagc 6360 
atatctcctc gtttcgcggg ttggggcggc tttcgctgta eggcagtagt cggtgctcgt 6420 
ccagacgggc cagggtcatg tctttccacg ggcgcagggt cctcgtcagc gtagtctggg 6480 
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tcacggtgaa ggggtgcgct ccgggctgcg cgctggccag ggtgcgcttg aggctggtcc 654 0 
tgctggtgct gaagcgctgc cggtcttcgc cctgcgcgtc ggccaggtag catttgacca 6600 
tggtgtcata gtccagcccc tccgcggcgt ggcccttggc gcgcagcttg cccttggagg 6660 
aggcgccgca cgaggggcag tgcagacttt tgagggcgta gagcttgggc gcgagaaata 6720 
ccgattccgg ggagtaggca tccgcgccgc aggccccgca gacggtctcg cattccacga 6780 
gccaggtgag ctctggccgt tcggggtcaa aaaccaggtt tcccccatgc tttttgatgc 6840 
gtttcttacc tctggtttcc atgagccggt gtccacgctc ggtgacgaaa aggctgtccg 6900 
tgtccccgta tacagacttg agaggcctgt cctcgagcgg tgttccgcgg tcctcctcgt 6960 
atagaaactc ggaccactct gagacaaagg ctcgcgtcca ggccagcacg aaggaggcta 7020 
agtgggaggg gtagcggtcg ttgtccacta gggggtccac tcgctccagg gtgtgaagac 7080 
acatgtcgcc ctcttcggca tcaaggaagg tgattggttt gtaggtgtag gccacgtgac 7140 
cgggtgttcc tgaagggggg ctataaaagg gggtgggggc gcgttcgtcc tcactctctt 7200 
ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta ctccctctga aaagcgggca 7260 
tgacttctgc gctaagattg tcagtttcca aaaacgagga ggatttgata ttcacctggc 7320 
ccgcggtgat gcctttgagg gtggccgcat ccatctggtc agaaaagaca atctttttgt 7380 
tgtcaagctt ggtggcaaac gacccgtaga gggqgttgga cagcaacttg gcgatggagc 7440 
gcagggtttg gtttttgtcg cgatcggcgc gctccttggc cgcgatgttt agctgcacgt 7500 
attcgcgcgc aacgcaccgc cattcgggaa agacggtggt gcgctcgtcg ggcaccaggt 7560 
gcacgcgcca accgcggttg tgcagggtga caaggtcaac gctggtggct acctctccgc 7620 
gtaggcgctc gttggtccag cagaggcggc cgcccttgcg cgagcagaat ggcggtaggg 7680 
ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt aaagaccccg ggcagcaggc 7740 
gcgcgtcgaa gtagtctatc ttgcatcctt. -gcaagtctag cgcctgctgc catgcgcggg 7800 
cggcaagcgc gcgctcgtat gggttgagtg;. ggggacccca tggcatgggg tgggtgagcg 7860 
cggaggcgta catgccgcaa, atgtcgtaaa cgtagagggg ctctctgagt attccaagat 7920 
atgtagggta gcatcttcca^ ccgcggatgc.: tggcgcgcac gtaatcgtat agttcgtgcg 7980 
agggagcgag gaggtcggga:>cGgaggfct'gc tacgggcggg ctgctctgct cggaagacta 8040 
tctgcctgaa gatggcatgt 'gagttggatg atatggttgg acgctggaag acgttgaagc 8100 
tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc gtaggagtcg cgcagcttgt 8160 
tgaccagctc ggcggtgacc tgcacgtcta gggcgcagta gtccagggtt tccttgatga 8220 
tgtcatactt atcctgtccc ttttttttcc acagctcgcg gttgaggaca aactcttcgc 8280 
ggtctttcca gtactcttgg atcggaaacc cgtcggcctc cgaacggtaa gagcctagca 8340 . 
tgtagaactg gttgacggcc tggtaggcgc agcatccctt ttctacgggt agcgcgtatg 8400 
cctgcgcggc cttccggagc gaggtgtggg tgagcgcaaa ggtgtccctg accatgactt 8460 
tgaggtactg gtatttgaag tcagtgtcgt cgcatccgcc ctgctcccag agcaaaaagt 8520 
ccgtgcgctt tttggaacgc ggatttggca gggcgaaggt gacatcgttg aagagtatct 8580 
ttcccgcgcg aggcataaag ttgcgtgtga tgcggaaggg tcccggcacc tcggaacggt 8640 
tgttaattac ctgggcggcg agcacgatct cgtcaaagcc gttgatgttg tggcccacaa 8700 
tgtaaagttc caagaagcgc gggatgccct tgatggaagg caatttttta agttcctcgt 8760 
aggtgagctc ttcaggggag ctgagcccgt gctctgaaag ggcccagtct gcaagatgag 8820 
ggttggaagc gacgaatgag ctccacaggt cacgggccat tagcatttgc aggtggtcgc 8880 
gaaaggtcct aaactggcga cctatggcca ttttttctgg ggtgatgcag tagaaggtaa 8940 
gcgggtcttg ttcccagcgg tcccatccaa ggttcgcggc taggtctcgc gcggcagtca 9000 
ctagaggctc atctccgccg aacttcatga ccagcatgaa gggcacgagc tgcttcccaa 9060 
aggcccccat ccaagtatag gtctctacat cgtaggtgac aaagagacgc tcggtgcgag 9120 
gatgcgagcc gatcgggaag aactggatct cccgccacca attggaggag tggctattga 9180 
tgtggtgaaa gtagaagtcc ctgcgacggg ccgaacactc gtgctggctt ttgtaaaaac 9240 
gtgcgcagta ctggcagcgg tgcacgggct gtacatcctg cacgaggttg acctgacgac 9300 
cgcgcacaag gaagcagagt gggaatttga gcccctcgcc tggcgggttt ggctggtggt 9360 
cttctacttc ggctgcttgt ccttgaccgt ctggctgctc gaggggagtt acggtggatc 9420 
ggaccaccac gccgcgcgag cccaaagtcc agatgtccgc gcgcggcggt cggagcttga 9480 
tgacaacatc gcgcagatgg gagctgtcca tggtctggag ctcccgcggc gtcaggtcag 9540 
gcgggagctc ctgcaggttt acctcgcata gacgggtcag ggcgcgggct agatccaggt 9600 
gatacctaat ttccaggggc tggttggtgg cggcgtcgat ggcttgcaag aggccgcatc 9660 
cccgcggcgc gactacggta ccgcgcggcg ggcggtgggc cgcgggggtg tccttggatg 9720 
atgcatctaa aagcggtgac gcgggcgagc ccccggaggt agggggggct ccggacccgc 9780 
c 99gagaggg ggcaggggca cgtcggcgcc gcgcgcgggc aggagctggt gctgcgcgcg 9840 
taggttgctg gcgaacgcga cgacgcggcg gttgatctcc tgaatctggc gcctctgcgt 9900 
gaagacgacg ggcccggtga gcttgagcct gaaagagagt tcgacagaat caatttcggt 9960 
gtcgttgacg gcggcctggc gcaaaatctc ctgcacgtct cctgagttgt cttgataggc 10020 
gatctcggcc atgaactgct cgatctcttc ctcctggaga tctccgcgtc cggctcgctc 10080 
cacggtggcg gcgaggtcgt tggaaatgcg ggccatgagc tgcgagaagg cgttgaggcc 10140 
tccctcgttc cagacgcggc tgtagaccac gcccccttcg gcatcgcggg cgcgcatgac 10200 
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cacctgcgcg agattgagct ccacgtgccg ggcgaagacg gcgtagtttc gcaggcgctg 10260 
aaagaggtag ttgagggtgg tggcggtgtg ttctgccacg aagaagtaca taacccagcg 10320 
tcgcaacgtg gattcgttga tatcccccaa ggcctcaagg cgctccatgg cctcgtagaa 10380 
gtccacggcg aagttgaaaa actgggagtt gcgcgccgac acggttaact cctcctccag 10440 
aagacggatg agctcggcga cagtgtcgcg cacctcgcgc tcaaaggcta caggggcctc 10500 
ttcttcttct tcaatctcct cttccataag ggcctcccct tcttcttctt ctggcggcgg 10560 
tgggggaggg gggacacggc ggcgacgacg gcgcaccggg aggcggtcga caaagcgctc 10620 
gatcatctcc ccgcggcgac ggcgcatggt ctcggtgacg gcgcggccgt tctcgcgggg 10680 
gcgcagttgg aagacgccgc ccgtcatgtc ccggttatgg gttggcgggg ggctgccatg 10740 
cggcagggat acggcgctaa cgatgcatct caacaattgt tgtgtaggta ctccgccgcc 10800 
gagggacctg agcgagtccg catcgaccgg atcggaaaac ctctcgagaa aggcgtctaa 10860 
ccagtcacag tcgcaaggta ggctgagcac cgtggcgggc ggcagcgggc ggcggtcggg 10920 
gttgtttctg gcggaggtgc tgctgatgat gtaattaaag taggcggtct tgagacggcg 10980 
gatggtcgac agaagcacca tgtccttggg tccggcctgc tgaatgcgca ggcggtcggc 11040 
catgccccag gcttcgtttt gacatcggcg caggtctttg tagtagtctt gcatgagcct 11100 
ttctaccggc acttcttctt ctccttcctc ttgtectgca tctcttgcat ctatcgctgc 11160 
ggcggcggcg gagtttggcc gtaggtggcg ccctcttcct cccatgcgtg tgaccccgaa 11220 
gcccctcatc ggctgaagca gggctaggtc ggcgacaacg cgctcggcta atatggcctg 11280 
ctgcacctgc gtgagggtag actggaagtc atccatgtcc acaaagcggt ggtatgcgcc 11340 
cgtgttgatg gtgtaagtgc agttggccat aacggaccag ttaacggtct ggtgacccgg 11400 
ctgcgagagc tcggtgtacc tgagacgcga gtaagccctc gagtcaaata cgtagtcgtt 11460 
■ gcaagtccgc accaggtact ggtatcccac caaaaagtgc ggcggcggct ggcggtagag 11520 
v -h$ gggccagcgt agggtggccg gggctccggg ggcgagatct tccaacataa ggcgatgata. 11580 
tccgtagatg tacctggaca tccaggtgat gccggcggcg gtggtggagg ^cgcgcggaaa 11640 
gtcgcggacg cggttccaga tgttgcgcag cggcaaaaag tgctccatgg^tcgggacgct,: 11700 
ctggccggtc aggcgcgcgc aatcgttgac gctctagacc gtgcaaaagg Vagagcctgta* 11760 
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg cggacgaccg 11820 
gggttcgagc cccgtatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 11880 
aacccaggtg tgcgacgtca gacaacgggg gagtgctcct tttggcttcc ttccaggcgc 11940 
ggcggctgct gcgctagctt ttttggccac tggccgcgcg cagcgtaagc ggttaggctg 12000 
gaaagcgaaa gcattaagtg gctcgctccc tgtagccgga gggttatttt ccaagggttg 12060 
agtcgcggga cccccggttc gagtctcgga ccggccggac tgcggcgaac gggggtttgc 12120 
ctccccgtca tgcaagaccc cgcttgcaaa ttcctccgga aacagggacg agcccctttt 12180 
ttgcttttcc cagatgcatc cggtgctgcg gcagatgcgc ccccctcctc agcagcggca 12240 
agagcaagag cagcggcaga catgcagggc accctcccct cctcctaccg cgtcaggagg 12300 
ggcgacatcc gcggttgacg cggcagcaga tggtgattac gaacccccgc ggcgccgggc 12360 
ccggcactac ctggacttgg aggagggcga gggcctggcg cggctaggag cgccctctcc 12420 
tgagcggtac ccaagggtgc agctgaagcg tgatacgcgt gaggcgtacg tgccgcggca 12480 
gaacctgttt cgcgaccgcg agggagagga gcccgaggag atgcgggatc gaaagttcca 12540 
cgcagggcgc gagctgcggc atggcctgaa tcgcgagcgg ttgctgcgcg aggaggactt 12600 
tgagcccgac gcgcgaaccg ggattagtcc cgcgcgcgca cacgtggcgg ccgccgacct 12660 
ggtaaccgca tacgagcaga cggtgaacca ggagattaac tttcaaaaaa gctttaacaa 12720 
ccacgtgcgt acgcttgtgg cgcgcgagga ggtggctata ggactgatgc atctgtggga 12780 
ctttgtaagc gcgctggagc aaaacccaaa tagcaagccg ctcatggcgc agctgttcct 12840 
tatagtgcag cacagcaggg acaacgaggc attcagggat gcgctgctaa acatagtaga 12900 
gcccgagggc cgctggctgc tcgatttgat aaacatcctg cagagcatag tggtgcagga 12960 
gcgcagcttg agcctggctg acaaggtggc cgccatcaac tattccatgc ttagcctggg 13020 
caagttttac gcccgcaaga tataccatac cccttacgtt cccatagaca aggaggtaaa 13080 
gatcgagggg ttctacatgc gcatggcgct gaaggtgctt accttgagcg acgacctggg 13140 
cgtttatcgc aacgagcgca tccacaaggc cgtgagcgtg agccggcggc gcgagctcag 13200 
cgaccgcgag ctgatgcaca gcctgcaaag ggccctggct ggcacgggca gcggcgatag 13260 
agaggccgag tcctactttg acgcgggcgc tgacctgcgc tgggccccaa gccgacgcgc 13320 
cctggaggca gctggggccg gacctgggct ggcggtggca cccgcgcgcg ctggcaacgt 13380 
cggcggcgtg gaggaatatg acgaggacga tgagtacgag ccagaggacg gcgagtacta 13440 
agcggtgatg tttctgatca gatgatgcaa gacgcaacgg acccggcggt gcgggcggcg 13500 
ctgcagagcc agccgtccgg ccttaactcc acggacgact ggcgccaggt catggaccgc 13560 
atcatgtcgc tgactgcgcg caatcctgac gcgttccggc agcagccgca ggccaaccgg 13620 
ctctccgcaa ttctggaagc ggtggtcccg gcgcgcgcaa accccacgca cgagaaggtg 13680 
ctggcgatcg taaacgcgct ggccgaaaac agggccatcc ggcccgacga ggccggcctg 13740 
gtctacgacg cgctgcttca gcgcgtggct cgttacaaca gcggcaacgt gcagaccaac 13800 
ctggaccggc tggtggggga tgtgcgcgag gccgtggcgc agcgtgagcg cgcgcagcag 13860 
ca 99gcaacc tgggctccat ggttgcacta aacgccttcc tgagtacaca gcccgccaac 13920 
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gtgccgcggg gacaggagga ctacaccaac tttgtgagcg cactgcggct aatggtgact 13980 
gagacaccgc aaagtgaggt gtaccagtct gggccagact attttttcca gaccagtaga 14040 
caaggcctgc agaccgtaaa cctgagccag gctttcaaaa acttgcaggg gctgtggggg 14100 
gtgcgggctc ccacaggcga ccgcgcgacc gtgtctagct tgctgacgcc caactcgcgc 14160 
ctgttgctgc tgctaatagc gcccttcacg gacagtggca gcgtgtcccg ggacacatac 14220 
ctaggtcact tgctgacact gtaccgcgag gccataggtc aggcgcatgt ggacgagcat 14280 
actttccagg agattacaag tgtcagccgc gcgctggggc aggaggacac gggcagcctg 14340 
gaggcaaccc taaactacct gctgaccaac cggcggcaga agatcccctc gttgcacagt 14400 
ttaaacagcg aggaggagcg cattttgcgc tacgtgcagc agagcgtgag ccttaacctg 14460 
atgcgcgacg gggtaacgcc cagcgtggcg ctggacatga ccgcgcgcaa catggaaccg 14520 
ggcatgtatg cctcaaaccg gccgtttatc aaccgcctaa tggactactt gcatcgcgcg 14580 
gccgccgtga accccgagta tttcaccaat gccatcttga acccgcactg gctaccgccc 14640 
cctggtttct acaccggggg attcgaggtg cccgagggta acgatggatt cctctgggac 14700 
gacatagacg acagcgtgtt ttccccgcaa ccgcagaccc tgctagagtt gcaacagcgc 14760 
gagcaggcag aggcggcgct gcgaaaggaa agcttccgca ggc'caagcag cttgtccgat 14820 
ctaggcgctg cggccccgcg gtcagatgct agtagcccat ttccaagctt gatagggtct 14880 
cttaccagca ctcgcaccac ccgcccgcgc ctgctgggcg aggaggagta cctaaacaac 14940 
tcgctgctgc agccgcagcg cgaaaaaaac ctgcctccgg catttcccaa caacgggata 15000 
gagagcctag tggacaagat gagtagatgg aagacgtacg cgcaggagca cagggacgtg 15060 
ccaggcccgc gcccgcccac ccgtcgtcaa aggcacgacc gtcagcgggg tctggtgtgg 15120 
gaggacgatg actcggcaga cgacagcagc gtcctggatt tgggagggag tggcaacccg. 15180 
tttgcgcacc ttcgccccag, -gctggggaga atgttttaaa aaaaaaaaag catgatgcaa 15240 
aataaaaaac tcaccaaggc ,catggcaccg agcgttggtt ttcttgtatt ccccttagta 15300 
tgcggcgcgc^ggcgatatat gaggaaggtc ctcctccctc ctacgagagt gtggtgagcg 15360 
cggcgccagt, ggcggcggpg ctgggttcjtc ccttcgatgc tcccctggac ccgccgtttg 15420 • 
tgcctccgcg ; . : gfc : abctgcgg cctaccgggg ggagaaacag catccgttac tctgagttgg 15480 
cacccctatt .cgacaceacc cgtgtgtacc tggtggacaa caagtcaacg gatgtggcat 15540 
ccctgaacta ccagaacgac cacagcaact ttctgaccac ggtcattcaa aacaatgact 15600 
acagcccggg ggaggcaagc acacagacca tcaatcttga cgaccggtcg cactggggcg 15660 
gcgacctgaa aaccatcctg cataccaaca tgccaaatgt gaacgagttc atgtttacca 15720 
ataagtttaa ggcgcgggtg atggtgtcgc gcttgcctac taaggacaat caggtggagc 15780 
tgaaatacga gtgggtggag ttcacgctgc ccgagggcaa ctactccgag accatgacca 15840 
tagaccttat gaacaacgcg atcgtggagc actacttgaa agtgggcaga cagaacgggg 15900 
ttctggaaag cgacatcggg gtaaagtttg acacccgcaa cttcagactg gggtttgacc 15960 
ccgtcactgg tcttgtcatg cctggggtat atacaaacga agccttccat ccagacatca 16020 
ttttgctgcc aggatgcggg gtggacttca cccacagccg cctgagcaac ttgttgggca 16080 
tccgcaagcg gcaacccttc caggagggct ttaggatcac ctacgatgat ctggagggtg 16140 
gtaacattcc cgcactgttg gatgtggacg cctaccaggc gagcttgaaa gatgacaccg 16200 
aacagggcgg gggtggcgca ggcggcagca acagcagtgg cagcggcgcg gaagagaact 16260 
ccaacgcggc agccgcggca atgcagccgg tggaggacat gaacgatcat gccattcgcg 16320 
gcgacacctt tgccacacgg gctgaggaga agcgcgctga ggccgaagca gcggccgaag 16380 
ctgccgcccc cgctgcgcaa cccgaggtcg agraagcctca gaagaaaccg gtgatcaaac 16440 
ccctgacaga ggacagcaag aaacgcagtt acaacctaat aagcaatgac agcaccttca 16500 
cccagtaccg cagctggtac cttgcataca actacggcga ccctcagacc ggaatccgct 16560 
catggaccct gctttgcact cctgacgtaa cctgcggctc ggagcaggtc tactggtcgt 16620 
tgccagacat gatgcaagac cccgtgacct tccgctccac gcgccagatc agcaactttc 16680 
cggtggtggg cgccgagctg ttgcccgtgc actccaagag cttctacaac gaccaggccg 16740 
tctactccca actcatccgc cagtttacct ctctgaccca cgtgttcaat cgctttcccg 16800 
agaaccagat tttggcgcgc ccgccagccc ccaccatcac caccgtcagt gaaaacgttc 16860 
ctgctctcac agatcacggg acgctaccgc tgcgcaacag catcggagga gtccagcgag 16920 
tgaccattac tgacgccaga cgccgcacct gcccctacgt ttacaaggcc ctgggcatag 16980 
tctcgccgcg cgtcctatcg agccgcactt tttgagcaag catgtccatc cttatatcgc 17040 
ccagcaataa cacaggctgg ggcctgcgct tcccaagcaa gatgtttggc ggggccaaga 17100 
agcgctccga ccaacaccca gtgcgcgtgc gcgggcacta ccgcgcgccc tggggcgcgc 17160 
acaaacgcgg ccgcactggg cgcaccaccg tcgatgacgc catcgacgcg gtggtggagg 17220 
aggcgcgcaa ctacacgccc acgccgccac cagtgtccac agtggacgcg gccattcaga 17280 
ccgtggtgcg cggagcccgg cgctatgcta aaatgaagag acggcggagg cgcgtagcac 17340 
gtcgccaccg ccgccgaccc ggcactgccg cccaacgcgc ggcggcggcc ctgcttaacc 17400 
gcgcacgtcg caccggccga cgggcggcca tgcgggccgc tcgaaggctg gccgcgggta 17460 
ttgtcactgt gccccccagg tccaggcgac gagcggccgc cgcagcagcc gcggccatta 17520 
gtgctatgac tcagggtcgc aggggcaacg tgtattgggt gcgcgactcg gttagcggcc 17580 
tgcgcgtgcc cgtgcgcacc cgccccccgc gcaactagat tgcaagaaaa aac tact tag 17640 
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actcgtactg ttgtatgtat ccagcggcgg 
aaatcaaaga agagatgctc caggtcatcg 
aagagcagga ttacaagccc cgaaagctaa 
atgatgaact tgacgacgag gtggaactgc 
agtggaaagg tcgacgcgta aaacgtgttt 
ccggtgagcg ctccacccgc acctacaagc 
acctgcttga gcaggccaac gagcgcctcg 
acatgctggc gttgccgctg gacgagggca 
tgcagcaggt gctgcccgcg cttgcaccgt 
ctggtgactt ggcacccacc gtgcagctga 
tcttggaaaa aatgaccgtg gaacctgggc 
agcaggtggc gccgggactg ggcgtgcaga 
gcaccagtat tgccaccgcc acagagggca 
cggtggcgga tgccgcggtg caggcggtcg 
tgcaaacgga cccgtggatg tttcgcgttt 
agtacggcgc cgccagcgcg ctactgcccg 
cccccggcta tcgtggctac acctaccgcc 
ccaccactgg aacccgccgc cgccgtcgcc 
tgcgcagggt ggctcgcgaa ggaggcagga 
ccagcatcgt ttaaaagccg gtctttgtgg 
tccgtttccc ggtgccggga ttccgaggaa 
acggcctgac gggcggcatg cgtcgtgcgc 
gcatgcgcgg cggtatcctg cccctcctta 
. tgcccggaat tgcatccgtg gccttgcagg 
tgtggaaaaa tcaaaataaa aagtctggac 
tgtagaatgg aagacatcaa ctttgcgtct 
atgggaaact ggcaagatat cggcaccagc 
tcgctgtgga gcggcattaa aaatttcggt 
tggaacagca gcacaggcca gatgctgagg 
aaggtggtag atggcctggc ctctggcatt 
gtgcaaaata agattaacag taagcttgat 
gccgtggaga cagtgtctcc agaggggcgt 
gaaactctgg tgacgcaaat agacgagcct 
ctgcccacca cccgtcccat cgcgcccatg 
gtaacgctgg acctgcctcc ccccgccgac 
accgccgttg ttgtaacccg tcctagccgc 
cgatcgttgc ggcccgtagc cagtggcaac 
ctgggggtgc aatccctgaa gcgccgacga 
tgtcatgtat gcgtccatgt cgccgccaga 
ccaagatggc taccccttcg atgatgccgc- 
acgcctcgga gtacctgagc cccgggctgg 
tcagcctgaa taacaagttt agaaacccca 
accggtccca gcgtttgacg ctgcggttca 
cgtacaaggc gcggttcacc ctagctgtgg 
. cgtactttga catccgcggc gtgctggaca 
ctgcctacaa cgccctggct cccaagggtg 
ctactgctct tgaaataaac ctagaagaag 
agcaagctga gcagcaaaaa actcacgtat 
ttacaaagga gggtattcaa ataggtgtcg 
catttcaacc tgaacctcaa ataggagaat 
cagctgggag agtccttaaa aagactaccc 
aacccacaaa tgaaaatgga gggcaaggca 
aaagtcaagt ggaaatgcaa tttttctcaa 
acttgactcc taaagtggta -ttgtacagtg 
atatttctta catgcccact attaaggaag 
ctatgcccaa caggcctaat tacattgctt 
acaacagcac gggtaatatg ggtgttctgg 
tagatttgca agacagaaac acagagcttt 
atagaaccag gtacttttct atgtggaatc 
gaattattga aaatcatgga actgaagatg 
gtgtgattaa tacagagact cttaccaagg 
gggaaaaaga tgctacagaa ttttcagata 



cggcgcgcaa cgaagctatg tccaagcgca 17700 
cgccggagat ctatggcccc ccgaagaagg 17760 
agcgggtcaa aaagaaaaag aaagatgatg 17820 
tgcacgctac cgcgcccagg cgacgggtac 17880 
tgcgacccgg caccaccgta gtctttacgc 17940 
gcgtgtatga tgaggtgtac ggcgacgagg 18000 
gggagtttgc ctacggaaag cggcataagg 18060 
acccaacacc tagcctaaag cccgtaacac 18120 
ccgaagaaaa gcgcggccta aagcgcgagt 18180 
tggtacccaa gcgccagcga ctggaagatg 18240 
tggagcccga ggtccgcgtg cggccaatca 18300 
ccgtggacgt tcagataccc actaccagta 18360 
tggagacaca aacgtccccg gttgcctcag 18420 
ctgcggccgc gtccaagacc tctacggagg 18480 
cagccccccg gcgcccgcgc ggttcgagga 18540 
aatatgccct acatccttcc attgcgccta 18600 
ccagaagacg agcaactacc cgacgccgaa 18660 
gtcgccagcc cgtgctggcc ccgatttccg 18720 
ccctggtgct gccaacagcg cgctaccacc 18780 
ttcttgcaga tatggccctc acctgccgcc 18840 
gaatgcaccg taggaggggc atggccggcc 18900 
accaccggcg gcggcgcgcg tcgcaccgtc 18960 
ttccactgat cgccgcggcg attggcgccg 19020 
cgcagagaca 'cfcgattaaaa acaagttgca 19080 
tctcacgctc.^gc.ttggtcct ■ gtaactattt 19140 
ctggccccgc /gacacggctc gcgcccgttc 19200 
aatatgagcg gtggcgcctt cagctggggc 19260 
tccaccgtta agaactatgg cagcaaggcc 19320 
gataagttga aagagcaaaa tttccaacaa 19380 
agcggggtgg tggacctggc caaccaggca 19440 
ccccgccctc ccgtagagga gcctccaccg 19500 
ggcgaaaagc gtccgcgccc cgacagggaa 19560 
ccctcgtacg aggaggcact aaagcaaggc 19620 
gctaccggag tgctgggcca gcacacaccc 19680 
acccagcaga aacctgtgct gccaggcccg 19740 
gcgtccctgc gccgcgccgc cagcggtccg 19800 
tggcaaagca cactgaacag catcgtgggt 19860 
tgcttctgaa tagctaacgt gtcgtatgtg 19920 
ggagctgctg agccgccgcg cgcccgcttt 19980 
agtggtctta catgcacatc tcgggccagg 20040 
tgcagtttgc ccgcgccacc gagacgtact 20100 
cggtggcgcc tacgcacgac gtgaccacag 20160 
tccctgtgga ccgtgaggat actgcgtact 20220 
gtgataaccg tgtgctggac atggcttcca 20280 
ggggccctac ttttaagccc tactctggca 20340 
ccccaaatcc ttgcgaatgg gatgaagctg 20400 
aggacgatga caacgaagac gaagtagacg 20460 
ttgggcaggc gccttattct ggtataaata 20520 
aaggtcaaac acctaaatat gccgataaaa 20580 
ctcagtggta cgaaactgaa attaatcatg 20640 
caatgaaacc atgttacggt tcatatgcaa 20700 
ttcttgtaaa gcaacaaaat ggaaagctag 20760 
ctactgaggc gaccgcaggc aatggtgata 20820 
aagatgtaga tatagaaacc ccagacactc 20880 
gtaactcacg agaactaatg ggccaacaat 20940 
ttagggacaa ttttattggt ctaatgtatt 21000 
cgggccaagc atcgcagttg aatgctgttg 21060 
cataccagct tttgcttgat tccattggtg 21120 
aggctgttga cagctatgat ccagatgtta 21180 
aacttccaaa ttactgcttt ccactgggag 21240 
taaaacctaa aacaggtcag gaaaatggat 21300 
aaaatgaaat aagagttgga aataattttg 21360 
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ccatggaaat caatctaaat gccaacctgt ggagaaattt cctgtactcc aacatagcgc 21420 
tgtatttgcc cgacaagcta aagtacagtc cttccaacgt aaaaatttct gataacccaa 21480 
acacctacga ctacatgaac aagcgagtgg tggctcccgg gttagtggac tgctacatta 21540 
accttggagc acgctggtcc cttgactata tggacaacgt caacccattt aaccaccacc 21600 
gcaatgctgg cctgcgctac cgctcaatgt tgctgggcaa tggtcgctat gtgcccttcc 21660 
acatccaggt gcctcagaag ttctttgcca ttaaaaacct ccttctcctg ccgggctcat 21720 
acacctacga gtggaacttc aggaaggatg ttaacatggt tctgcagagc tccctaggaa 21780 
atgacctaag ggttgacgga gccagcatta agtttgatag catttgcctt tacgccacct 21840 
tcttccccat ggcccacaac accgcctcca cgcttgaggc catgcttaga aacgacacca 21900 
acgaccagtc ctttaacgac tatctctccg ccgccaacat gctctaccct atacccgcca 21960 
acgctaccaa cgtgcccata tccatcccct cccgcaactg ggcggctttc cgcggctggg 22020 
ccttcacgcg ccttaagact aaggaaaccc catcactggg ctcgggctac gacccttatt 22080 
acacctactc tggctctata ccctacctag atggaacctt ttacctcaac cacaccttta 22140 
agaaggtggc cattaccttt gactcttctg tcagctggcc tggcaatgac cgcctgctta 22200 
cccccaacga gtttgaaatt aagcgctcag ttgacgggga gggttacaac gttgcccagt 22260 
gtaacatgac caaagactgg ttcctggtac aaatgctagc taactacaac attggctacc 22320 
agggcttcta tatcccagag agctacaagg accgcatgta ctccttcttt agaaacttcc 22380 
agcccatgag ccgtcaggtg gtggatgata ctaaatacaa ggactaccaa caggtgggca 22440 
tcctacacca acacaacaac tctggatttg ttggctacct tgcccccacc atgcgcgaag .22500 
gacaggccta ccctgctaac ttcccctatc cgcttatagg caagaccgca gttgacagca 22560 
ttacccagaa aaagtttctt tgcgatcgca ccctttggcg catcccattc tccagtaact 22620 
ttatgtccat gggcgcactc acagacctgg gccaaaacct tctctacgcc aactccgccc 22680 
acgcgctaga -catgactttt gaggtggatc ccatggacga gcccaccctt ctttatgttt 22^4-01., 
tgt'fctgaagt-ctttgacgtg gtccgtgtgc accggccgca ccgcggcgtc atcgaaaccg^22a'0^-k: 
'tgtacctr§'cg' cacgcccttc tcggccggca acgccacaac ataaagaagc aagcaacatcr5^60-^-. 
'aacaVcagct gccgccatgg gctccagtga gcaggaactg aaagccattg tcaaagatct ? ?22f!32tf f' 
tggttgtggg ccatattttt tgggcaccta tgacaagcgc tttccaggct ttgtttctcc ^2.980 ^ 
acacaagctc gcctgcgcca tagtcaatac ggccggtcgc gagactgggg gcgtacactg 23040 
gatggccttt gcctggaacc cgcactcaaa aacatgctac ctctttgagc cctttggctt 23100 
ttctgaccag cgactcaagc aggtttacca gtttgagtac gagtcactcc tgcgccgtag 23160 
cgccattgct tcttcccccg accgctgtat aacgctggaa aagtccaccc aaagcgtaca 23220 
ggggcccaac tcggccgcct gtggactatt ctgctgcatg tttctccacg cctttgccaa 23280 
ctggccccaa actcccatgg atcacaaccc caccatgaac cttattaccg gggtacccaa 23340 
ctccatgctc aacagtcccc aggtacagcc caccctgcgt cgcaaccagg aacagctcta 23400 
cagcttcctg gagcgccact cgccctactt ccgcagccac agtgcgcaga ttaggagcgc 23460 
cacttctttt tgtcacttga aaaacatgta aaaataatgt actagagaca ctttcaataa 23520 
aggcaaatgc ttttatttgt acactctcgg gtgattattt acccccaccc ttgccgtctg 23580 
cgccgtttaa aaatcaaagg ggttctgccg cgcatcgcta tgcgccactg gcagggacac 23640 
gttgcgatac tggtgtttag tgctccactt aaactcaggc acaaccatcc gcggcagctc 23700 
ggtgaagttt tcactccaca ggctgcgcac catcaccaac gcgtttagca ggtcgggcgc 23760 
cgatatcttg aagtcgcagt tggggcctcc gccctgcgcg cgcgagttgc gatacacagg 23820 
gttgcagcac tggaacacta tcagcgccgg gtggtgcacg ctggccagca cgctcttgtc 23880 
ggagatcaga tccgcgtcca ggtcctccgc gttgctcagg gcgaacggag tcaactttgg 23940 
tagctgcctt cccaaaaagg gcgcgtgccc aggctttgag ttgcactcgc accgtagtgg 24000 
catcaaaagg tgaccgtgcc cggtctgggc gttaggatac agcgcctgca taaaagcctt 24060 
gatctgctta aaagccacct gagcctttgc gccttcagag aagaacatgc cgcaagactt 24120 
gccggaaaac tgattggccg gacaggccgc gtcgtgcacg cagcaccttg cgtcggtgtt 24180 
ggagatctgc accacatttc ggccccaccg gttcttcacg atcttggcct tgctagactg 24240 
ctccttcagc gcgcgctgcc cgttttcgct cgtcacatcc atttcaatca cgtgctcctt 24300 
atttatcata atgcttccgt gtagacactt aagctcgcct tcgatctcag cgcagcggtg 24360 
cagccacaac gcgcagcccg tgggctcgtg atgcttgtag gtcacctctg caaacgactg 24420 
caggtacgcc tgcaggaatc gccccatcat cgtcacaaag gtcttgttgc tggtgaaggt 24480 
cagctgcaac ccgcggtgct cctcgttcag ccaggtcttg catacggccg ccagagcttc 24540 
cacttggtca ggcagtagtt tgaagttcgc ctttagatcg ttatccacgt ggtacttgtc 24600 
catcagcgcg cgcgcagcct ccatgccctt ctcccacgca gacacgatcg gcacactcag 24660 
cgggttcatc accgtaattt cactttccgc ttcgctgggc tcttcctctt cctcttgcgt 24720 
ccgcatacca cgcgccactg ggtcgtcttc attcagccgc cgcactgtgc gcttacctcc 24780 
tttgccatgc ttgattagca ccggtgggtt gctgaaaccc accatttgta gcgccacatc 24840 
ttctctttct tcctcgctgt ccacgattac ctctggtgat ggcgggcgct cgggcttggg 24900 
agaagggcgc ttctttttct tcttgggcgc aatggccaaa tccgccgccg aggtcgatgg 24960 
ccgcgggctg ggtgtgcgcg gcaccagcgc gtcttgtgat gagtcttcct cgtcctcgga 25020 
ctcgatacgc cgcctcatcc gcttttttgg gggcgcccgg ggaggcggcg gcgacgggga 25080 
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cggggacgac acgtcctcca tggttggggg acgtcgcgcc gcaccgcgtc cgcgct'cggg 25140 
ggtggtttcg cgctgctcct cttcccgact ggccatttcc ttctcctata ggcagaaaaa 25200 
gatcatggag tcagtcgaga agaaggacag cctaaccgcc ccctctgagt tcgccaccac 25260 
cgcctccacc gatgccgcca acgcgcctac caccttcccc gtcgaggcac ccccgcttga 25320 
'ggaggaggaa gtgattatcg agcaggaccc aggttttgta ag'cgaagacg acgaggaccg 25380 
ctcagtacca acagaggata aaaagcaaga ccaggacaac gcagaggcaa acgaggaaca 2544 0 
agtcgggcgg ggggacgaaa ggcatggcga ctacctagat gtgggagacg acgtgctgtt 25500 
gaagcatctg cagcgccagt gcgccattat ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
gcccctcgcc atagcggatg tcagccttgc ctacgaacgc cacctattct caccgcgcgt 25620 
accccccaaa cgccaagaaa acggcacatg cgagcccaac ccgcgcctca acttctaccc 25680 
cgtatttgcc gtgccagagg tgcttgccac ctatcacatc tttttccaaa actgcaagat 2574 0 
acccctatcc tgccgtgcca accgcagccg agcggacaag cagctggcct tgcggcaggg 25800 
cgctgtcata cctgatatcg cctcgctcaa cgaagtgcca aaaatctttg agggtcttgg 25860 
acgcgacgag aagcgcgcgg caaacgctct gcaacaggaa aacagcgaaa atgaaagtca 25920 
ctctggagtg ttggtggaac tcgagggtga caacgcgcgc ctagccgtac taaaacgcag 25980 
catcgaggtc acccactttg cctacccggc acttaaccta ccccccaagg tcatgagcac 26040 
agtcatgagt gagctgatcg tgcgccgtgc gcagcccctg gagagggatg caaatttgca 26100 
agaacaaaca gaggagggcc tacccgcagt tggcgacgag cagctagcgc gctggcttca 26160 
aacgcgcgag cctgccgact tggaggagcg acgcaaacta atgatggccg cagtgctcgt 26220 
taccgtggag cttgagtgca tgcagcggtt- ctttgctgac ccggagatgc agcgcaagct 26280 
agaggaaaca ttgcactaca cctttcgaca gggctacgta cgccaggcct gcaagatctc 26340 
caacgtggag ctctgcaacc tggtctccta' ccttggaatt ttgcacgaaa accgccttgg 26400 
gcaaaacgtg cttcattcca cgctcaaggg. cgaggcgcgc cgcgactacg tccgcgactg 26460 
cgtttactta tttctatgct acacctggca^gacggccatg ggcgtttggc agcagtgctt 26520 
ggaggagtgc aacctcaagg agctgcagaa^actgctaaag caaaacttga aggacctatg 26580 
gacggccttc aacgagcgct ccgtggccgc [geacctggcg gacatcattt tccccgaacg 26640 
cctgcttaaa accctgcaac agggtctgc'c agacttcacc agtcaaagca tgttgcagaa 26700 
ctttaggaac tttatcctag agcgctcagg aatcttgccc gccacctgct gtgcacttcc 26760 
tagcgacttt gtgcccatta agtaccgcga atgccctccg ccgctt.tggg gccactgcta 26820 
ccttctgcag ctagccaact accttgccta ccactctgac ataatggaag acgtgagcgg 26880 
tgacggtcta ctggagtgtc actgtcgctg caacctatgc accccgcacc gctccctggt 26940 
ttgcaattcg cagctgctta acgaaagtca aattatcggt acctttgagc tgcagggtcc 27000 
ctcgcctgac gaaaagtccg cggctccggg gttgaaactc actccggggc tgtggacgtc 27060 
ggcttacctt cgcaaatttg tacctgagga ctaccacgcc cacgagatta ggttctacga 27120 
agaccaatcc cgcccgccaa atgcggagct taccgcctgc gtcattaccc agggccacat 27180 
tcttggccaa ttgcaagcca tcaacaaagc ccgccaagag tttctgctac gaaagggacg 27240 
gggggtttac ttggaccccc agtccggcga ggagctcaac ccaatccccc cgccgccgca 27300 
gccctatcag cagcagccgc gggcccttgc ttcccaggat ggcacccaaa aagaagctgc 27360 
agctgccgcc gccacccacg gacgaggagg aatactggga cagtcaggca gaggaggttt 27420 
tggacgagga ggaggaggac atgatggaag actgggagag cctagacgag gaagcttccg 27480 
aggtcgaaga ggtgtcagac gaaacaccgt caccctcggt cgcattcccc tcgccggcgc 27540 
cccagaaatc ggcaaccggt tccagcatgg ctacaacctc cgctcctcag gcgccgccgg 27600 
cactgcccgt tcgccgaccc aaccgtagat gggacaccac tggaaccagg gccggtaagt 27660 
ccaagcagcc gccgccgtta gcccaagagc aacaacagcg ccaaggctac cgctcatggc 27720 
gcgggcacaa gaacgccata gttgcttgct tgcaagactg tgggggcaac atctcctztcg 27780 
cccgccgctt tcttctctac catcacggcg tggccttccc ccgtaacatc ctgcattact 27840 
accgtcatct ctacagccca tactgcaccg gcggcagcgg cagcggcagc aacagcagcg 27900 
gccacacaga agcaaaggcg accggatagc aagactctga caaagcccaa gaaatccaca 27960 
gcggcggcag cagcaggagg aggagcgctg cgtctggcgc ccaacgaacc cgtatcgacc 28020 
cgcgagctta gaaacaggat ttttcccact ctgtatgcta tatttcaaca gagcaggggc 28080 
caagaacaag agctgaaaat aaaaaacagg tctctgcgat ccctcacccg cagctgcctg 28140 
tatcacaaaa gcgaagatca gcttcggcgc acgctggaag acgcggaggc tctcttcagt 28200 
aaatactgcg cgctgactct taaggactag tttcgcgccc tttctcaaat ttaagcgcga 28260 

aaarl-arnhn ahrhrrannrf anna nanr* net cinrtnnannan r-hrrf- nnf- nan: nnnnai- 1- at-n 2 fl 
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ttcacgcctc gtcaggcaat cctaactctg 
ggcattggaa ctctgcaatt tattgaggag 
tcgggacctc ccggccacta tccggatcaa 
tcggcggacg gctacgactg aatgttaagt 
ctggtccact gtcgccgcca caagtgcttt 
gaattgcccg aggatcatat cgagggcccg 
gagcttgccc gtagcctgat tcgggagttt 
aggggaccct gtgttctcac tgtgatttgc 
ttaattaatt gccacatcct cttacacttt 
ttgtgttatg tttcaacgtg tttatttttc 
tcagtagtat agccccacca ccacatagct 
acagaaccct agtattcaac ctgccacctc 
ctccccggct ggccttaaaa agcatcatat 
tattccacac ggtttcctgt cgagccaaac 
gcagctcact taagttcatg tcgctgtcca 
gcggttgctt aacgggcggc gaaggagaag 
cgtgcatcag gatagggcgg tggtgctgca 
gctccgtcct gcaggaatac aacatggcag 
gcagcataag gcgccttgtc ctccgggcac 
cacagtaact gcagcacagc accacaatat 
atccaaagct catggcgggg accacagaac 
vagattaagtg gcgacccctc ataaacacgc 
' V ..tgtaattcac cacctcccgg taccatataa 

* A V ccatcctaaa ccagctggcc aaaacctgcc 
tggaacaatg acagtggaga gcccaggact 

f^ZS** tatcaatgtt ggcacaacac aggcacacgt 

o <^ cccgcgttag aaccatatcc cagggaacaa 
tgcagggaag acctcgcacg taactcacgt 
gcagcggatg atcctccagt atggtagcgc 
ccctactgta cggagtgcgc cgagacaacc 
atggaacgcc ggacgtagtc atatttcctg 
gatctgcgtc tccggtctcg ccgcttagat 
tctctcaaag catccaggcg ccccctggct 
gctgccctga taacatccac caccgcagaa 
. ttctgcgagt cacacacggg aggagcggga 
ttccaaaaga ttatccaaaa cctcaaaatg 
ggtggcgtgg tcaaactcta cagccaaaga 
aatggcttcc aaaaggcaaa cggccctcac 
agggtgaatc tcctctataa acattccagc 
tcgccacctt ctcaatatat ctctaagcaa 
aatctgctcc agagcgccct ccaccttcag 
tcaggttcct cacagacctg tataagattc 
tcccgtaggt cccttcgcag ggccagctga 
gcggccactt ccccgccagg aaccttgaca 
ctcggagcta tgctaaccag cgtagccccg 
aaaatgcaag gtgctgctca aaaaatcagg 
gtagtcatgc tcatgcagat aaaggcaggt 
catttttctc tcaaacatgt ctgcgggttt 
aacatttaaa cattagaagc ctgtcttaca 
cggactacgg ccatgccggc gtgaccgtaa 
accgacagct cctcggtcat gtccggagtc 
tgattcatcg gtcagtgcta aaaagcgacc 
gcgtagagac aacattacag cccccatagg 
cacataaaca cctgaaaaac cctcctgcct 
aacatacagc gcttcacagc ggcagcctaa 
tattaaaaaa acaccactcg acacggcacc 
caagtgcaga gcgagtatat ataggactaa 
aacacccaga aaaccgcacg cgaacctacg 
ttcctcaaat cgtcacttcc gttttcccac 
caattcccaa cacatacaag ttactccgcc 
cgccccgcgc cacgtcacaa actccacccc 
aaggtatatt attgatgatg 



cagacctcgt cctctgagcc gcgctctgga 2 8860 
tttgtgccat cggtctactt taaccccttc 28920 
tttattccta actttgacgc ggtaaaggac 28980 
ggagaggcag agcaactgcg cctgaaacac 29040 
gcccgcgact ccggtgagtt ttgctacttt 29100 
gcgcacggcg tccggcttac cgcccaggga 29160 
acccagcgcc ccctgctagt tgagcgggac 29220 
aactgtccta accttggatt acatcaagat 29280 
ttcatacatt gcccaagaat aaagaatcgt 29340 
aattgcagaa aatttcaagt catttttcat 29400 
tatacagatc accgtacctt^ aatcaaactc 29460 
cctcccaaca cacagagtac acagtccttt 29520 
catgggtaac agacatattc ttaggtgtta 29580 
gctcatcagt gatattaata aactccccgg 29640 
gctgctgagc cacaggctgc . tgtccaactt 29700 
tccacgccta catgggggta gagtcataat 29760 
gcagcgcgcg aataaactgc tgccgccgcc 29820 
tggtctcctc agcgatgatt cgcaccgccc 29880 
agcagcgcac cctgatctca cttaaatcag 29940 
tgttcaaaat cccacagtgc aaggcgctgt 30000 
ccacgtggcc atcataccac aagcgcaggt 30060 
tggacataaa cattacctct tttggcatgtA30120 
acctctgatt aaacatggcg ccatccacGa ; "j30180 
cgccggctat acactgcagg> gaaccgggac 30240 
cgtaaccatg gatcatcatg^ctcgtcatga 30300 
gcatacactt cctcaggatt^-acaagbtcct 30360 
cccattcctg aatcagcgta'aatcccacac 30420 
tgtgcattgt caaagtgtta cattcgggca 30480 
gggtttctgt ctcaaaagga ggtagacgat 30540^ 
gagatcgtgt tggtcgtagt gtcatgccaa 30600 
aagcaaaacc aggtgcgggc gtgacaaaca 30660 
cgctctgtgt agtagttgta gtatatccac 30720 
tcgggttcta tgtaaactcc ttcatgcgcc 30780 
taagccacac ccagccaacc tacacattcg 30840 
agagctggaa gaaccatgtt ttttttttta 30900 
aagatctatt aagtgaacgc gctcccctcc 30960 
acagataatg gcatttgtaa gatgttgcac 31020 
gtccaagtgg acgtaaaggc taaacccttc 31080 
accttcaacc atgcccaaat aattctcatc 31140 
atcccgaata ttaagtccgg ccattgtaaa 31200 
cctcaagcag cgaatcatga ttgcaaaaat 31260 
aaaagcggaa cattaacaaa aataccgcga 31320 
acataatcgt gcaggtctgc acggaccagc 31380 
aaagaaccca cactgattat gacacgcata 31440 
atgtaagctt tgttgcatgg gcggcgatat 31500 
caaagcctcg cgcaaaaaag aaagcacatc 31560 
aagctccgga accaccacag aaaaagacac 31620 
ctgcataaac acaaaataaa ataacaaaaa 31680 
acaggaaaaa caacccttat aagcataaga 31740 
aaaaactggt caccgtgatt aaaaagcacc 31800 
ataatgtaag actcggtaaa cacatcaggt 31860 
gaaatagccc gggggaatac atacccgcag 31920 
aggtataaca aaattaatag gagagaaaaa 31980 
aggcaaaata gcaccctccc gctccagaac 32040 
cagtcagcct taccagtaaa aaagaaaacc 32100 
agctcaatca gtcacagtgt aaaaaagggc 32160 
aaaatgacgt aacggttaaa gtccacaaaa 32220 
cccagaaacg aaagccaaaa aacccacaac 32280 
gttacgtaac ttcccatttt aagaaaacta 32340 
ctaaaaccta cgtcacccgc cccgttccca 32400 
ctcattatca tattggcttc aatccaaaat 32460 

32480 
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<210> 24 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

ctcaacaatt gtggatccgt actcc , 25 

<210> 25 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

gtgctcagca gatcttgcga ctgtg 25 
<210> 26 - :1> ^7 l: y. vfv 

<211> 25 < . l .> \ ' 

<212> DNA ** : . 

<213> Artificial- Sequence $ 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 26 

ggcgcgttcg gatccactct cttcc 25 

<210> 27 
<2il> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 27 

ctacatgcta ggcagatctc gttcggag 28 

<210> 28 
<211> 1240 
<212> DNA 
<213> adenovirus, 

<400> 28 

ggatccactc tcttccgcat cgctgtctgc gagggccagc tgttggggtg agtactccct 60 
ctgaaaagcg ggcatgactt ctgcgctaag attgtcagtt tccaaaaacg aggaggattt 120 
gatattcacc tggcccgcgg tgatgccttt gagggtggcc gcatccatct ggtcagaaaa 180 
gacaatcttt ttgttgtcaa gcttggtggc aaacgacccg tagagggcgt tggacagcaa 240 
cttggcgatg gagcgcaggg tttggttttt gtcgcgatcg gcgcgctcct tggccgcgat 300 
gtttagctgc acgtattcgc gcgcaacgca ccgccattcg ggaaagacgg tggtgcgctc 360 
gtcgggcacc aggtgcacgc gccaaccgcg gttgtgcagg gtgacaaggt caacgctggt 420 
ggctacctct ccgcgtaggc gctcgttggt ccagcagagg cggccgccct tgcgcgagca 480 
gaatggcggt agggggtcta gctgcgtctc gtccgggggg tctgcgtcca cggtaaagac 540 
cccgggcagc aggcgcgcgt cgaagtagtc tatcttgcat ccttgcaagt ctagcgcctg 600 
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ctgccatgcg cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac cccatggcat 660 
ggggtgggtg agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct 720 
gagtattcca agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc 780 
gtatagttcg tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc 840 
tgctcggaag actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg 900 
gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 
gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 1020 
ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 1080 
gacaaactct- tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 1140 
agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 1200 
tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct 1240 

<210> 29 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV60 
<400> 29 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc ,tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct .gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg" aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacgggg.tc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc^tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gcttggtacc 900 
gagctcggat ccactctctt ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta 960 
ctccctctga aaagcgggca tgacttctgc gctaagattg tcagtttcca aaaacgagga 1020 
ggatttgata ttcacctggc ccgcggtgat gcctttgagg gtggccgcat ccatctggtc 1080 
agaaaagaca atctttttgt tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga 1140- 
cagcaacttg gcgatggagc gcagggtttg gtttttgtcg cgatcggcgc gctccttggc 1200 
cgcgatgttt agctgcacgt attcgcgcgc aacgcaccgc cattcgggaa agacggtggt 1260 
gcgctcgtcg ggcaccaggt gcacgcgcca accgcggttg tgcagggtga caaggtcaac 1320 
gctggtggct acctctccgc gtaggcgctc gttggtccag cagaggcggc cgcccttgcg 1380 
cgagcagaat ggcggtaggg ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt 1440 
aaagaccccg ggcagcaggc gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag 1500 
cgcctgctgc catgcgcggg cggcaagcgc gcgctcgtat gggttgagtg ggggacccca 1560 
tggcatgggg tgggtgagcg cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg 1620 
ctctctgagt attccaagat atgtagggta gcatcttcca ccgcggatgc tggcgcgcac 1680 
gtaatcgtat agttcgtgcg agggagcgag gaggtcggga ccgaggttgc tacgggcggg 1740 
ctgctctgct cggaagacta tctgcctgaa gatggcatgt gagttggatg atatggttgg 1800 
acgctggaag acgttgaagc tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc 1860 
gtaggagtcg cgcagcttgt tgaccagctc ggcggtgacc tgcacgtcta gggcgcagta 1920 
gtccagggtt tccttgatga tgtcatactt atcctgtccc ttttttttcc acagctcgcg 1980 
gttgaggaca aactcttcgc ggtctttcca gtactcttgg atcggaaacc cgtcggcctc 2040 
cgaacgagat ccgtactccg ccgccgaggg acctgagcga gtccgcatcg accggatcgg 2100 
aaaacctctc gagaaaggcg tctaaccagt cacagtcgca agatccaaga tgaagcgcgc 2160 
aagaccgtct gaagatacct tcaaccccgt gtatccatat gacacggaaa ccggtcctcc 2220 
aactgtgcct tttcttactc ctccctttgt atcccccaat gggtttcaag agagtccccc 2280 
tggggtactc tctttgcgcc tatccgaacc tctagttacc tccaatggca tgcttgcgct 2340 
caaaatgggc aacggcctct ctctggacga ggccggcaac cttacctccc aaaatgtaac 2400 
cactgtgagc ccacctctca aaaaaaccaa gtcaaacata aacctggaaa tatctgcacc 2460 
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cctcacagtt acctcagaag ccctaactgt ggctgccgcc gcacctctaa tggtcgcggg 2520 
caacacactc accatgcaat cacaggcccc gctaaccgtg cacgactcca aacttagcat 2580 
tgccacccaa ggacccctca cagtgtcaga aggaaagcta gccctgcaaa catcaggccc 2640 
cctcaccacc accgatagca gtacccttac tatcactgcc tcaccccctc taactactgc 2700 
cactggtagc ttgggcattg acttgaaaga gcccatttat acacaaaatg gaaaactagg 2760 
actaaagtac ggggctcctt tgcatgtaac agacgaccta aacactttga ccgtagcaac 2820 
tggtccaggt gtgactatta ataatacttc cttgcaaact aaagttactg gagccttggg 2880 
ttttgattca caaggcaata tgcaacttaa tgtagcagga ggactaagga ttgattctca 2940 
aaacagacgc cttatacttg atgttagtta tccgtttgat gctcaaaacc aactaaatct 3000 
aagactagga cagggccctc tttttataaa ctcagcccac aacttggata ttaactacaa 3060 
caaaggcctt tacttgttta cagcttcaaa caattccaaa aagcttgagg ttaacctaag 3120 
cactgccaag gggttgatgt .ttgacgctac agccatagcc attaatgcag gagatgggct 3180 
tgaatttggt tcacctaatg caccaaacac aaatcccctc aaaacaaaaa ttggccatgg 3240 
cctagaattt gattcaaaca aggctatggt tcctaaacta ggaactggcc ttagttttga 3300 
cagcacaggt gccattacag taggaaacaa aaataatgat aagctaactt tgtggaccac 3360 
accagctcca tctcctaact gtagactaaa tgcagagaaa gatgctaaac tcactttggt 3420 
cttaacaaaa tgtggcagtc aaatacttgc tacagtttca gttttggctg ttaaaggcag 3480 
tttggctcca atatctggaa cagttcaaag tgctcatctt attataagat ttgacgaaaa 3540 
tggagtgcta ctaaacaatt ccttcctgga cccagaatat tggaacttta gaaatggaga 3600 
tcttactgaa ggcacagcct atacaaacgc tgttggattt atgcctaacc tatcagctta 3660 
tccaaaatct cacggtaaaa ctgccaaaag taacattgtc agtcaagttt* acttaaacgg 3720 
agacaaaact aaacetgtaa cactaaccat tacactaaac ggtacacagg aaacaggaga 3780 
cacaactcca agtgcatact ctatgtcatt ttcatgggac tggtctggcc acaactacat 3840 
taatgaaata* :tttgccacat cctcttacac tttttcatac attgcccaag aataaaagaa 3900."- ??■ 
gcggccgfetc-'gagcatgcat ctagagggcc ctattctata gtgtcaccta aatgctagag 39.60-^j 
ctcgctgatc^agcctcgact" : gtgccttcta gttgccagcc atctgttgtt tgcccctccc 4Q2Q*^-?k 
ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt cctttcctaa taaaatgagg 4080 H > 
aaattgcatc gcattgtctg agtaggtgtc attctattct ggggggtggg gtggggcagg 4140 
acagcaaggg ggaggattgg . gaagacaata gcaggcatgc tggggatgcg gtgggctcta 4200 
tggcttctga ggcggaaaga accagctggg gctctagggg gtatccccac gcgccctgta 4260 
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 4320 
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct 4380 
ttccccgtca agctctaaat cggggcatcc ctttagggtt ccgatttagt gctttacggc 4440 
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 4500 
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4560 
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgg 4620 
ggatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaattaat 4680 
tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag gcaggcagaa 4740 
gtatgcaaag catgcatctc aattagtcag caaccaggtg tggaaagtcc ccaggctccc 4800 
cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccata gtcccgcccc 4860 
taactccgcc catcccgccc ctaactccgc ccagttccgc ccattctccg ccccatggct 4920 
gactaatttt ttttatttat gcagaggccg aggccgcctc tgcctctgag ctattccaga 4980 
agtagtgagg aggctttttt ggaggcctag gcttttgcaa aaagctcccg ggagcttgta 5040 
tatccatttt cggatctgat caagagacag gatgaggatc gtttcgcatg attgaacaag 5100 
atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc tatgactggg 5160 
cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc 5220 
cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag gacgaggcag 5280 
cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc gacgttgtca 5340 
ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat 5400 
ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg cggctgcata 5460 
cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc gagcgagcac 5520 
gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag catcaggggc 5580 
tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc gaggatctcg 5640 
tcgtgaccca tggcgatgcc tgcttgccga atatcatggt* ggaaaatggc cgcttttctg 5700 
gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta 5760 
cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg 5820 
gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct 5880 
gagcgggact ctggggttcg aaatgaccga ccaagcgacg cccaacctgc catcacgaga 5940 
tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc 6000 
cggctggatg atcctccagc gcggggatct catgctggag ttcttcgccc accccaactt 6060 
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 6120 
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 6180 
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tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 6240 
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 6300 
taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 6360 
cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 6420 
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 6480 
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 6540 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 6600 
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 6660 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6720 
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6780 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta 6840 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 6900 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 6960 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 7020 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7080 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 7140 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 7200 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 7260 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7320 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 7380 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 7440 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 7500 
tggccccagt gctgcaatga taccgcgaga- cccacgctca ccggctccag atttatcagc 7560 
aataaaccag ccagccggaa gggccgagcg t.cagaagtggt cctgcaactt tatccgcctc 7620 
catccagtct attaattgtt gccgggaagc.pt agagtaagt agttcgccag ttaatagttt 7680 * 
gcgcaacgtt gttgccattg ctacaggcat^cgtggtgtca cgctcgtcgt ttggtatggc 7740 
ttcattcagc tccggttccc aacgatcaag .gcgagttaca tgatccccca tgttgtgcaa 7800 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 7860 
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 7920 
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 7980 
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 8040 
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 8100 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 8160 
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 8220 
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 8280 
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 8340 
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 8383 



<210> 30 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; 



plasmid pDV67 



<400> 30 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 

gtaggcgtgt 

ctgcttactg 



gagatctccc 

aagccagtat 

ttaagctaca. 

gcgttttgcg 

tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 



gatcccctat 
ctgctccctg 
acaaggcaag 
ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa. 
attaatacga 



ggtcgactct 
cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 



cagtacaatc 
ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 



tgctctgatg 60 
gagtagtgcg 120 
aagaatctgc 180 
cgttgacatt 240 
agcccatata 300 
cccaacgacc 360 
gggactttcc 420 
catcaagtgt 480 
gcctggcatt 540 
gtattagtca 600 
tagcggtttg 660 
ttttggcacc 720 
caaatgggcg 780 
agagaaccca 840 
gctggctagc 900 
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gtttaaactt aagcttggta ccgagctcgg 
gggccagctg ttggggtgag tactccctct 
tgtcagtttc caaaaacgag gaggatttga 
gggtggccgc atccatctgg tcagaaaaga 
acgacccgta gagggcgttg gacagcaact 
cgcgatcggc gcgctccttg gccgcgatgt 
gccattcggg aaagacggtg gtgcgctcgt 
tgtgcagggt gacaaggtca acgctggtgg 
agcagaggcg gccgcccttg cgcgagcaga 
ccggggggtc tgcgtccacg gtaaagaccc 
tcttgcatcc ttgcaagtct agcgcctgct 
atgggttgag tgggggaccc catggcatgg 
aaatgtcgta aacgtagagg ggctctctga 
caccgcggat gctggcgcgc acgtaatcgt 
gaccgaggtt gctacgggcg ggctgctctg 
gtgagttgga tgatatggtt ggacgctgga 
ccgcgtcacg cacgaaggag gcgtaggagt 
cctgcacgtc tagggcgcag tagtccaggg 
cctttttttt ccacagctcg cggttgagga 
ggatcggaaa cccgtcggcc tccgaacgag 
gagtccgcat cgaccggatc ggaaaacctc 
caagatccaa gatgaagcgc gcaagaccgt 
atgacacgga aaccggtcct ccaactgtgc 
^ atgggtttca agagagtccc cctggggtac 
^cctccaatgg catgcttgcg ctcaaaatgg 
; >accttacctc ccaaaatgta accactgtga 
^aaacctgga aatatctgca cccctcacag 
ccgcacctct aatggtcgcg ggcaacacac 
tgqacgactc caaacttagc attgccaccc 
tagccctgca aacatcaggc cccctcacca 
cctcaccccc tctaactact gccactggta 
atacacaaaa tggaaaacta ggactaaagt 
taaacacttt gaccgtagca actggtccag 
ctaaagttac tggagccttg-ggttttgatt 
gaggactaag gattgattct caaaacagac 
atgctcaaaa ccaactaaat ctaagactag 
acaacttgga tattaactac aacaaaggcc 
aaaagcttga ggttaaccta agcactgcca 
ccattaatgc aggagatggg cttgaatttg 
tcaaaacaaa aattggccat ggcctagaat 
taggaactgg ccttagtttt gacagcacag 
ataagctaac tttgtggacc acaccagctc 
aagatgctaa actcactttg gtcttaacaa 
cagttttggc tgttaaaggc agtttggctc 
ttattataag atttgacgaa aatggagtgc 
attggaactt tagaaatgga gatcttactg 
ttatgcctaa cctatcagct tatccaaaat 
tcagtcaagt ttacttaaac ggagacaaaa 
acggtacaca ggaaacagga gacacaactc 
actggtctgg ccacaactac attaatgaaa 
acattgccca agaataaaag aagcggccgc 
gatcagcctc gactgtgcct tctagttgcc 
cttccttgac cctggaaggt gccactccca 
catcgcattg tctgagtagg tgtcattcta 
agggggagga ttgggaagac aatagcaggc 
ctgaggcgga aagaaccagc tggggctcta 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 



atccactctc ttccgcatcg ctgtctgcga 960 
gaaaagcggg catgacttct gcgctaagat 1020 
tattcacctg gcccgcggtg atgcctttga 1080 
caatcttttt gttgtcaagc ttggtggcaa 1140 
tggcgatgga gcgcagggtt tggtttttgt 1200 
ttagctgcac gtattcgcgc gcaacgcacc 1260 
cgggcaccag gtgcacgcgc caaccgcggt 1320 
ctacctctcc gcgtaggcgc tcgttggtcc 1380 
atggcggtag ggggtctagc tgcgtctcgt 1440 
cgggcagcag gcgcgcgtcg aagtagtcta 1500 
gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
ggtgggtgag cgcggaggcg tacatgccgc 1620 
gtattccaag atatgtaggg tagcatcttc 1680 
atagttcgtg cgagggagcg aggaggtcgg 1740 
ctcggaagac tatctgcctg aagatggcat 1800 
agac^ttgaa gctggcgtct gtgagaccta 1860 
cgcgcagctt gttgaccagc tcggcggtga 1920 
tttccttgat gatgtcatac ttatcctgtc 1980 
caaactcttc gcggtctttc cagtactctt 2040 
atccgtactc cgccgccgag ggacctgagc 2100 
tcgagaaagg cgtctaacca gtcacagtcg 2160 
ctgaagatac cttcaacccc gtgt ate cat 2:220 
cttttcttac tcctcccttt gtatccccca 2&80 
tetctttgeg cctatccgaa cctctagtta .2340 
gcaacggcct ctctctggac .-gaggGcggca-2400 : 
gcccacctct caaaaaaacc aagtcaaaca 2460 
ttacctcaga agccctaact gtggc.tgccg 2520 
tcaccatgca atcacaggcc ccgctaaccg 2580 
aaggacccct cacagtgtca gaaggaaagc 2640 
ccaccgatag cagtaccctt actatcactg 2700 
gcttgggcat tgacttgaaa gageccattt 2760 
acggggctcc tttgcatgta acagacgacc 2820 
gtgtgactat taataatact tecttgeaaa 2880 
cacaaggcaa tatgeaaett aatgtagcag 2 940 
gecttatact tgatgttagt tatccgtttg 3000 
gacagggccc tctttttata aactcagccc 3060 
tttacttgtt tacagcttca aacaattcca 3120 
aggggttgat gtttgacget acagecatag 3180 
gttcacctaa tgcaccaaac acaaatcccc 3240 
ttgattcaaa caaggctatg gttcctaaac 3300 
gtgecattae agtaggaaac aaaaataatg 3360 
catctcctaa ctgtagacta aatgeagaga 3420 
aatgtggcag tcaaatactt gctacagttt 3480 
caatatctgg aacagttcaa agtgctcatc 3540 
tactaaacaa ttccttcctg gacccagaat 3600 
aaggcacagc ctatacaaac gctgttggat 3660 
etcaeggtaa aactgecaaa agtaacattg 3720 
ctaaacctgt aacactaacc attacactaa 3780 
caagtgeata ctctatgtca ttttcatggg 3840 
tatttgecac atcctcttac actttttcat 3900 
tcgagtctag agggecegtt taaacccgct 3960 
agecatctgt tgtttgcccc tcccccgtgc 4020 
ctgtcctttc ctaataaaat gaggaaattg 4080 
ttctgggggg tggggtgggg caggacagca 4140 
atgctgggga tgcggtgggc tctatggctt 4200 
gggggtatcc ccacgcgccc tgtagcggcg 4260 
geagegtgae cgctacactt gccagcgccc 4320 
cctttctcgc cacgttcgcc ggctttcccc 4380 
ggttccgatt tagtgettta cggcacctcg 4440 
cacgtagtgg gccatcgccc tgatagaegg 4500 
tctttaatag tggactcttg ttccaaactg 4560 
cttttgattt ataagggatt ttggggattt 4620 
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cggcctattg gttaaaaaat gagctgattt 
gaatgtgtgt cagttagggt gtggaaagtc 
aaagcatgca tctcaattag tcagcaacca 
gcagaagtat gcaaagcatg catctcaatt 
cgcccatccc gcccctaact ccgcccagtt 
ttttttttat ttatgcagag gccgaggccg 
gaggaggctt ttttggaggc ctaggctttt 
ttttcggatc tgatcagcac gtgttgacaa 
ataatacgac aaggtgagga actaaaccat 
caccgcgcgc gacgtcgccg gagcggtcga 
ggacttcgtg gaggacgact tcgccggtgt 
cgcggtccag gaccaggtgg tgccggacaa 
ggacgagctg tacgccgagt ggtcggaggt 
gccggccatg accgagatcg gcgagcagcc 
ggccggcaac tgcgtgcact tcgtggccga 
cgattccacc gccgccttct atgaaaggtt 
ctggatgatc ctccagcgcg gggatctcat 
tattgcagct tataatggtt acaaataaag 
atttttttca ctgcattcta gttgtggttt 
ctgtataccg tcgacctcta gctagagctt 
gtgaaattgt tatccgctca caattccaca 
agcctggggt gcctaatgag fcgagctaact 
tttccagtcg ggaaacctgt cgtgccagct 
aggcggtttg cgtattgggc^gctcttccgc 
cgttcggctgu.cggcgagcgg- tatcagctca 
atcaggggat .laacgcaggaa agaacatgtg 
taaaaaggcc gcgttgctgg cgtttttcca 
aaatcgacgc tcaagtcaga ggtggcgaaa 
tccccctgga agctccctcg tgcgctctcc 
gtccgccttt ctcccttcgg gaagcgtggc 
cagttcggtg taggtcgttc gctccaagct 
cgaccgctgc gccttatccg gtaactatcg 
atcgccactg gcagcagcca ctggtaacag 
tacagagttc ttgaagtggt ggcctaacta 
ctgcgctctg ctgaagccag ttaccttcgg 
acaaaccacc gctggtagcg gtggtttttt 
aaaaggatct caagaagatc ctttgabctt 
aaactcacgt taagggattt tggtcatgag 
tttaaattaa aaatgaagtt ttaaatcaat 
cagttaccaa tgcttaatca gtgaggcacc 
catagttgcc tgactccccg tcgtgtagat 
ccccagtgct gcaatgatac cgcgagaccc 
aaaccagcca gccggaaggg ccgagcgcag 
ccagtctatt aattgttgcc gggaagctag 
caacgttgtt gccattgcta caggcatcgt 
attcagctcc ggttcccaac gatcaaggcg 
agcggttagc tccttcggtc ctccgatcgt 
actcatggtt atggcagcac tgcataattc 
ttctgtgact ggtgagtact caaccaagtc 
ttgctcttgc ccggcgtcaa tacgggataa 
gctcatcatt ggaaaacgtt cttcggggcg 
atccagttcg atgtaaccca ctcgtgcacc 
cagcgtttct gggtgagcaa aaacaggaag 
gacacggaaa tgttgaatac tcatactctt 
gggttattgt ctcatgagcg gatacatatt 
ggttccgcgc acatttcccc gaaaagtgcc 

<210> 31 
<211> 30 
<2X2> DNA 

<213> Artificial Sequence 



aacaaaaatt taacgcgaat taattctgtg 4680 
cccaggctcc ccaggcaggc agaagtatgc 4740 
ggtgtggaaa gtccccaggc tccccagcag 4800 
agtcagcaac catagtcccg cccctaactc 4860 
ccgcccattc tccgccccat ggctgactaa 4920 
cctctgcctc tgagctattc cagaagtagt 4980 
gcaaaaagct cccgggagct tgtatatcca 5040 
ttaatcatcg gcatagtata tcggcatagt 5100 
ggccaagttg accagtgccg ttccggtgct 5160 
gttctggacc gaccggctcg ggttctcccg 5220 
ggtccgggac gacgtgaccc tgttcatcag 5280 
caccctggcc tgggtgtggg tgcgcggcct 5340 
cgtgtccacg aacttccggg acgcctccgg 5400 
gtgggggcgg gagttcgccc tgcgcgaccc 5460 
ggagcaggac tgacacgtgc tacgagattt 5520 
gggcttcgga. atcgttttcc gggacgccgg 5580 
gctggagttc ttcgcccacc ccaacttgtt 5640 
caatagcatc acaaatttca caaataaagc 5700 
gtccaaactc atcaatgtat cttatcatgt 5760 
ggcgtaatca tggtcatagc tgtttcctgt 5820~ 
caacatacga gccggaagca taaagtgtaa 5880 
cacattaatt gcgttgcgct cactgcccgc 5940 
gcattaatga atcggccaac gcgcggggag 6000 
ttcctcgctc actgactcgc tgcgctcggt 6060 
ctcaaaggcg gtaatacggt tatccacaga 6120 
agcaaaaggc cagcaaaagg ccaggaaccg 6180 
taggctccgc ccccctgacg agcatcacaa 6240 
cc cgacagga ctataaagat accaggcgtt 6300 
tgttccgacc ctgccgctta ccggatacct 6360 
gctttctcaa tgctcacgct gtaggtatct 642 0 
gggctgtgtg cacgaacccc ccgttcagcc 6480 
tcttgagtcc aacccggtaa gacacgactt 6540 
gattagcaga gcgaggtatg taggcggtgc 6600 
cggctacact agaaggacag tatttggtat 6660 
aaaaagagtt ggtagctctt gatccggcaa 6720 
tgtttgcaag cagcagatta cgcgcagaaa 6780 
ttctacgggg tctgacgctc agtggaacga 684 0 
attatcaaaa aggatcttca cctagatcct 6900 
ctaaagtata tatgagtaaa cttggtctga 6960 
tatctcagcg atctgtctat ttcgttcatc 7020 
aactacgata cgggagggct taccatctgg 7080 
acgctcaccg gctccagatt tatcagcaat 7140 
aagtggtcct gcaactttat ccgcctccat 7200 
agtaagtagt tcgccagtta atagtttgcg 7260 
ggtgtcacgc tcgtcgtttg gtatggcttc 7320 
agttacatga tcccccatgt tgtgcaaaaa 7380 
tgtcagaagt aagttggccg cagtgttatc 7440 
tcttactgtc atgccatccg taagatgctt 7500 
attctgagaa tagtgtatgc ggcgaccgag 7560 
taccgcgcca catagcagaa ctttaaaagt 7620 
aaaactctca aggatcttac cgctgttgag 7680 
caactgatct tcagcatctt ttactttcac 7740 
gcaaaatgcc gcaaaaaagg gaataagggc 7800 
cctttttcaa tattattgaa gcatttatca 7860 
tgaatgtatt tagaaaaata aacaaatagg 7920 
acctgacgtc 7960 
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<220> 

<223> Description of Artificial Sequence: primer 



<400> 31 

atgggatcca agatgaagcg cgcaagaccg 

<210> 32 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

cactatagcg gccgcattct cagtcatctt 30 

<210> 33 
<211> 7989 
<212> DNA 

<213> Artificial Sequence 

<220> , .* , 

<223> Description of Artificial Sequence,: plasmid. pDV69 

<400> 33 

gacggatcgg gagatctccc gatcccctat ggtcgactct ' cagtacaatc , tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca- 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta .acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 1020 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 1140 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 1260 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 1380 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 1440 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1500 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 1740 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 1860 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 
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caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 2220 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 2280 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 2340 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 2400 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 2460 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 2520 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 2580 
tgcacgactc caaacttagc . attgccaccc. aaggacccct cacagtgtca gaaggaaagc 2640 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 2700 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 2760 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 2 820 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 2 880 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 2940 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 3000 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 3 060 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 3120 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag 3180 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 3240 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 3300 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 3360 
ataagctaac tttgtggacc ggtccaaaac cagaagccaa ctgcataatt gaatacggga 3420 
aacaaaaccc agatagcaaa ctaactttaa tccttgtaaa aaatggagga attgttaatg 3480 
gatatgtaac gctaatggga gcctcagact acgttaacac cttatttaaa aacaaaaatg 3540 
tctccattaa tgtagaacta tactttgatg ccactggtca tatattacca gactcatctfc 360.0 
; ctcttaaaac agatctagaa ctaaaataca agcaaaccgc tgactttagt gcaagaggtt . 3660 . ;< 
ttatgccaag tactacagcg tatccatttg tccttcctaa tgcgggaaca cataatgaaa -3720 
attatatttt tggtcaatgc tactacaaag caagcgatgg tgcccttttt ccgttggaag 3780- 
ttactgttat gcttaataaa cgcctgccag atagtcgcac atcctatgtt atgacttttt 3 840 
tatggtcctt gaatgctggt ctagctccag aaactactca ggcaaccctc ataacctccc 3900 
catttacctt ttcctatatt agagaagatg actgattttt aagaagcggc cgctcgagtc 3960 
tagagggccc gtttaaaccc gctgatcagc ctcgactgtg ccttctagtt gccagccatc 4020 
tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct 4080 
ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg 4140 
999 fc 9999tg gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg 4200 
ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc snccntagct ggggctctag 4260 
ggggtatccc cacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 4320 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 43 80 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcggggca tccctttagg 4440 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 4500 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 4560 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 4620 
ttttgattta taagggattt tggggatttc ggcctattgg ttaaaaaatg agctgattta 4680 
acaaaaattt aacgcgaatt aattctgtgg aatgtgtgtc agttagggtg tggaaagtcc 4740 
ccaggctccc caggcaggca gaagtatgca aagcatgcat ctcaattagt cagcaaccag 4800 
gtgtggaaag tccccaggct ccccagcagg cagaagtatg caaagcatgc atctcaatta 4 860 
gtcagcaacc atagtcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc 4920 
cgcccattct ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc 4980 
ctctgcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc taggcttttg 5040 
caaaaagctc ccgggagctt gtatatccat tttcggatct gatcagcacg tgttgacaat 5100 
taatcatcgg catagtatat cggcatagta taatacgaca aggtgaggaa ctaaaccatg 5160 
gccaagttga ccagtgccgt tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag 5220 
ttctggaccg accggctcgg gttctcccgg gacttcgtgg aggacgactt cgccggtgtg 5280 
gtccgggacg acgtgaccct gt teat cage geggtccagg accaggtggt gccggacaac 5340 
accctggcct gggtgtgggt gcgcggcctg gacgagctgt aegecgagtg gtcggaggtc 5400 
gtgtccacga acttceggga cgcctccggg ccggccatga ccgagategg cgagcagccg 5460 
t 99999cggg agttcgccct gcgcgacccg gccggcaact gcgtgcactt cgtggccgag 5520 
gagcaggact gaeaegtget acgagatttc gattccaccg ccgccttcta tgaaaggttg 5580 
ggcttcggaa tcgttttccg ggacgccggc tggatgatcc tccagcgcgg ggatctcatg 5640 
ctggagttct tcgcccaccc caacttgttt attgeagett ataatggtta caaataaagc 5700 
aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg 5760 
tccaaactca tcaatgtatc ttatcatgtc tgtataccgt cgacctctag ctagagcttg 5820 
gegtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac 5880 
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aacatacgag ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc 5940 
acattaattg cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg 6000 
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 6060 
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 6120 
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 6180 
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 6240 
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 6300 
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 6360 
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 6420 
ctttctcaat gctcacgctg taggtatctc agttcggtgt~ aggtcgttcg ctccaagctg 6480 
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 6540 
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 6600 
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 6660 
ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 6720 
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 6780 
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt ,6840 
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 6900 
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 6960 
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 7020 
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata 7080 
actacgatac gggagggctt, accatctggc cccagtgctg caatgatacc gcgagaccca 7140 
cgctcaccgg ctccagattt atcagcaata aaccagccag- ccggaagggc cgagcgcaga 7200 
agtggtcctg caactttatc cgcctccatc cagtctatta'' attgttgccg ggaagctaga 7260 
gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg 7320 
gtgtcacgct - cgtcgtttgg tatggcttca' ttcagctccg gttcccaacg atcaaggcga 7380 
gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt 7440 
gtcagaagta agttggccgc agtgttatca ctcafcggtta tggcagcact gcataattct 7500 
cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca 7560 
ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat 7620 
accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga 7680 
aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc 7740 
aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg 7800 
caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc 7860 
ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt 7920 
gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca 7980 
cctgacgt c * 7989 

<210> 34 
<211> 7607 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid GRE5-El-SV40-Hygro 
<400> 34 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 
ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 120 
gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 180 
agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 240 
gcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat cgatcctgag 300 
aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 360 
atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 420 
tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 
ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 
ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 600 
tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 
acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 720 
ggaaatattc ttattggtag aaacaactac atcctggtca tcatcctgcc tttctcttta 780 
tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 840 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 900 
gttattgtgc tgtctcatca ttttggcaaa gaatt agate taagcttctg cagctcgagg 960 
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actcggtcga ctgaaaatga gacatattat 
ggccgccagt cttttggacc agctgatcga 
tagccatttt gaaccaccta cccttcacga 
agatcccaac gaggaggcgg tttcgcagat 
ggaagggatt gacttactca cttttccgcc 
ttcccggcag cccgagcagc cggagcagag 
tgtaccggag gtgatcgatc ttacctgcca 
ggatgaagag ggtgaggagt ttgtgttaga 
gtcttgtcat tatcaccgga ggaatacggg 
tatgaggacc tgtggcatgt ttgtctacag 
agtggtgggt ttggtgtggt aatttttttt 
ttttgtattg tgattttttt aaaaggtcct 
gaaccggagc ctgcaagacc tacccgccgt 
ccgacatcac ctgtgtctag agaatgcaat 
tctaacacac ctcctgagat acacccggtg 
gtgagagttg gtgggcgtcg ccaggctgtg 
gggcaacctt tggacttgag ctgtaaacgc 
cgtgtgtggt taacgccttt gtttgctgaa 
ataatgttta acttgcatgg cgtgttaaat 
cgtgggctaa tcttggttac atctgacctc 
tctgctgtgc gtaacttgct ggaacagagc 
ctgtggggct catcccaggc aaagttagtc 
tttgaagagc ttttgaaatc ctgtggtgag 
gcgcttttcc aagagaaggt catcaagact 
gctgctgttg cttttttgag ttttataaag 
ggggggtacc tgctggattt tctggccatg 
: aatcgcctgc tactgttgtc ttccgtccgc 
cagcagcagg aggaagccag gcggcggcgg 
ggcctggacc ctcgggaatg aatgttgtac 
gcattttgac aattacagag gatgggcagg 
cttgtgaggc tacagaggag gctaggaatc 
ctgagtgtat tacttttcaa cagatcaagg 
cgcagaagta ttccatagag cagctgacca 
aggaggctat tagggtatat gcaaaggtgg 
gcaaacttgt aaatatcagg aattgttgct 
tagatacgga ggatagggtg gcctttagat 
ttggcatgga cggggtggtt attatgaatg 
cggttttcct ggccaatacc aaccttatcc 
atacctgtgt ggaagcctgg accgatgtaa 
ggaagggggt ggtgtgtcgc cccaaaagca 
ggtgtacctt gggtatcctg tctgagggta 
actgtggttg cttcatgcta gtgaaaagcg 
gcaactgcga ggacagggcc tctcagatgc 
tgaagaccat tcacgtagcc agccactctc 
tactgacccg ctgttccttg catttgggta 
gcaatttgag tcacactaag atattgcttg 
acggggtgtt tgacatgacc atgaagatct 
ccaggtgcag accctgcgag tgtggcggta 
atgtgaccga ggagctgagg cccgatcact 
gctctagcga tgaagataca gattgaggta 
gaaagaatat ataaggtggg ggtcttatgt 
ccgccatgag caccaactcg tttgatggaa 
tgcccccatg ggccggggtg cgtcagaatg 
tcctgcccgc aaactctact accttgacct 
ctgcagcctc cgccgccgct tcagccgctg 
ttgctttcct gagcccgctt gcaagcagtg 
agttgacggc tcttttggca caattggatt 
agcagctgtt ggatctgcgc cagcaggttt 
cggtttaaaa cataaataaa aaaccagact 
gctgtctcag ctgactgctt aagtcgcaag 
ttaaagcaga acttgtttat tgcagcttat 
aatttcacaa ataaagcatt tttttcactg 



ctgccacgga ggtgttatta ccgaagaaat 1020 
agaggtactg gctgataatc ttccacctcc 1080 
actgtatgat ttagacgtga cggcccccga 1140- 
ttttcccgac tctgtaatgt tggcggtgca 1200 
ggcgcccggt tctccggagc cgcctcacct 1260 
agccttgggt ccggtttcta tgccaaacct 1320 
cgaggctggc tttccaccca gtgacgacga 1380 
ttatgtggag caccccgggc acggttgcag 1440 
ggacccagat . attatgtgtt cgctttgcta 1500 
taagtgaaaa ttatgggcag tgggtgatag 1560 
ttaattttta cagttttgtg gtttaaagaa 1620 
gtgtctgaac ctgagcctga gcccgagcca 1680 
cctaaaatgg cgcctgctat cctgagacgc 1740 
agtagtacgg atagctgtga ctccggtcct 1800 
gtcccgctgt gccccattaa accagttgcc 1860 
gaatcjtatcg aggacttgct taacgagcct 1920 
cccaggccat aaggtgtaaa cctgtgattg 1980 
tgagttgatg taagtttaat aaagggtgag 2040 
ggggcggggc ttaaagggta tataatgcgc 2100 
atggaggctt gggagtgttt ggaagatttt 2160 
tctaacagta cctcttggtt ttggaggttt 2220 
tgcagaatta aggaggatta caagtgggaa 2280-. 
ctgtttgatt ctttgaatct gggtcaccag 2340 
ttggattttt ccacaccggg gcgcgctgcg 2400 
gataaatgga gcgaagaaac ccatctgagcL 2460 
catctgtgga gagcggttgt gagacacaag 2520 
ccggcgataa taccgacgga ggagcagcag, 2580 
caggagcaga gcccatggaa cccgagagcc 2640 
aggtggctga actgtatcca gaactgagac 2700 
ggctaaaggg ggtaaagagg gagcgggggg 2760 
tagcttttag cttaatgacc agacaccgtc 2820 
ataattgcgc taatgagctt gatctgctgg 2880 
cttactggct gcagccaggg gatgattttg 2940 
cacttaggcc agattgcaag tacaagatca 3000 
acatttctgg gaacggggcc gaggtggaga 3060 
gtagcatgat aaatatgtgg ccgggggtgc 3120 
taaggtttac tggccccaat -tttagcggta 3180 
tacacggtgt aagcttctat gggtttaaca 3240 
gggttcgggg ctgtgccttt tactgctgct 3300 
gggcttcaat taagaaatgc ctctttgaaa 3360 
actccagggt gcgccacaat gtggcctccg 3420 
tggctgtgat taagcataac atggtatgtg 3480 
tgacctgctc ggacggcaac tgtcacctgc 3540 
gcaaggcctg gccagtgttt gagcataaca 3600 
acaggagggg ggtgttccta ccttaccaat 3660 
agcccgagag catgtccaag gtgaacctga 3720 
ggaaggtgct gaggtacgat gagacccgca 3780 
aacatattag gaaccagcct gtgatgctgg 3840 
tggtgctggc ctgcacccgc gctgagtttg 3900 
ctgaaatgtg tgggcgtggc ttaagggtgg 3960 
agttttgtat ctgttttgca gcagccgccg 4020 
gcattgtgag ctcatatttg acaacgcgca 4080 
tgatgggctc cagcattgat ggtcgccccg 4140 
acgagaccgt gtctggaacg ccgttggaga 4200 
cagccaccgc ccgcgggatt gtgactgact 4260 
cagcttcccg ttcatccgcc cgcgatgaca 4320 
ctttgacccg ggaacttaat gtcgtttctc 4380 
ctgccctgaa ggcttcctcc cctcccaatg 4440 
ctgtttggat ttggatcaag caagtgtctt 4500 
ccgaattgga tccaattcgg atcgatctta 4560 
aatggttaca aataaagcaa tagcatcaca 4620 
cattctagtt gtggtttgtc caaactcatc 4680 
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aatgtatctt atcatgtctg gtcgactcta gactcttccg cttcctcgct cactgactcg 4740 
ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg 4800 
ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag 4860 
gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac 4920 
gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga 4980 
taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt 504*0 
accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc 5100 
tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc 5160 
cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta 5220 
agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat 5280 
gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca 5340 
gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct 5400 
tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt 5460 
acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct 5520 
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 5580 
-acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 5640 
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 5700 
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 5760 
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 5820 
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc . tgcaacttta 5880 
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 5940 
aatagtttgc gcaacgttgt tgccattgct;- acaggcatcg tggtgtcacg ctcgtcgttt 6000 
ggtatggctt cattcagctc cggttcccaavcgatcaaggc gagttacatg atcc'cccatg 6060 
ttgtgcaaaa aagcggttag. ctccttcggt cctccgatcg ttgtcagaag taagttggcc 6120 
gcagtgttat cactcatggt. tatggcagca ctgcataatt ctcttactgt catgccatcc 6180 
gtaagatgct tttctgtgac;, tggtgagtac tcaaccaagt cattctgaga atagtgtatg 6240 
cggcgaccga gttgctcttg; cccggcgtca atacgggata ataccgcgcc acatagcaga 6300 
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 6360 
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 6420 
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 6480 
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 6540 
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 6600 
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc 6660 
attattatca tgacattaac ctataaaaat aggcgtatca cgaggcccct ttcgtctcgc 6720 
gcgtttcggt gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc 6780 
ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg 6840 
cgggtgtcgg ggctggctta actatgcggc atcagagcag attgtactga gagtgcacca 6900 
tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca ggaaattgta 6960 
agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac 7020 
caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg 7080 
agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa 7140 
gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc ctaatcaagt 7200 
tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag cccccgattt 7260 
agagcttgac ggggaaagcc ggcgaacgtg . gcgagaaagg aagggaagaa agcgaaagga . 7320 
gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac cacacccgcc 7380 
gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg caactgttgg 7440 
gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 7500 
gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg 7560 
gccagtgaat tgtaatacga ctcactatag ggcgaattaa ttcgggg 7607 



<210> 35 
<211> 11600 

<212> DNA ' 
<213> Artificial Sequence 

<220> 

<223> Description of ' Artificial Sequence: plasmid MMTV-E2a-SV40-Neo 
<400> 35 

gaattccgca ttgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt 60 
tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactcccga 120 
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gagtgtcct-a cacctagggg agaagcagcc 
tctgcgcaca aacggatgag cccatcagac 
ggcatagctc tgctttgcct ggggctattg 
ctcacccttg actcttttaa tagctcttct 
aactcgacct tcctcctgag gcaaggacca 
ttttgtcctt cagaaataga aataagaatg 
accaatccaa taggtagatt attagttact 
actattttta ctcaaattca gaagttagaa 
acctcaattg aagaacaggt gcaaggacta 
aaaaagagtg tttttgtcaa aataggagac 
ccttacatct acagaccaac agatgccccc 
tgggataggt gggttacagt caatggctat 
gaaagactcg ccagagctag acctccttgg 
atgaaacaac aggtacatga ttatatttat 
attttccata ccaaggaggg gacagtggct 
catggcatga gttattatga atagccttta 
agtaagtttt tggttacaaa ctgttcttaa 
cttggtttgg tatcaaaggt tctgatctga 
ttggaattta tccaaatctt atgtaaatgc 
attttttgag taaacttgca acagtcctaa 
gccatcccgt ctccgctcgt cacttatcct 
cggcgaccct caggtcggcc gactgcggca 
gtgacccttg tctctatttc tactatttgg 
ggctatcatc acaagagcgg aacggactca 
cgtccaagac cctcaaagat ttttggcact 
gcgccctgcc gcaaggccag ctgcttgtcc 
ggtatcttgc agttttggaa aaagatgtga 
acggggtaga agttgaggcg cgggttgggc 
ggtacgcgcg gtgagaatag gtggcgttcg 
ggcacatcgc tgcgctcttg caacgcgtcg 
ttcaacagca cgtcgtctcc cacatctagg 
acttgttcct cgtttgcctc tgcgttgtcc 
gagcggtcct cgtcgtcttc gcttacaaaa 
tcctcaagcg ggggtgcctc gacggggaag 
gcggtggtgg cgaactcaga gggggcggtt 
atctttttct gcctatagga gaaggaaatg 
acccccgagc gcggacgcgg tgcggcgcga 
ccgtccccgt cgccgccgcc tccccgggcg 
gagtccgagg acgaggaaga ctcatcacaa 
cggccatcga cctcggcggc ggatttggcc 
tctcccaagc ccgagcgccc gccatcacca 
gaagatgtgg cgctacaaat ggtgggtttc 
aaaggaggta agcgcacagt gcggcggctg 
cggacgcaag aggaagagga agagcccagc 
ccgctgagtg tgccgatcgt gtctgcgtgg 
atggacaagt accacgtgga taacgatcta 
gtggaagctc tggcggccgt atgcaagacc 
ctgaccttca ccagcaacaa gacctttgtg 
ctgcagtcgt ttgcagaggt gacctacaag 
ctgcaccgct gcgctgagat cgaaggcgag 
aataaggagc acgtgattga aatggatgtg 
gagcagtcta gcaaggccaa gatcgtgaag 
tccaacaccg acgcaaggtg ctgcgtgcac 
ggcaagtctt gcggcatgtt cttctctgaa 
atcaaggctt ttatgcaggc gctgtatcct 
atgccactac ggtgcgagtg caactcaaag 
ctaccaaagt tgactccgtt cgccctgagc 
tccgacaaga gcgtgctggc cagcgtgcac 
aaccctgtgt atcgcaactc gcgcgcgcag 
tcggcgcccg acctgctaaa cgcgttggtg 
accgagctgc cgcggatggt tgtgcctgag 
aacgtgtccc tgccagtggc gcatagcgat 



aaggggttgt ttcccaccaa ggacgacccg 180 
aaagacatat tcattctctg ctgcaaactt 240 
ggggaagttg cggttcgtgc tcgcagggct 300 
gtgcaagatt acaatctaaa caattcggag 360 
cagccaactt cctcttacaa gccgcatcga 420 
cttgctaaaa attatatttt taccaataag 480 
atgttaagaa atgaatcatt atcttttagt 540 
atgggaatag aaaatagaaa gagacgctca 600 
ttgaccacag gcctagaagt aaaaaaggga 660 
aggtggtggc aaccagggac ttatagggga 720 
ttaccatata caggaagata tgacttaaat 7 80 
aaagtgttat atagatccct cccttttcgt 840 
tgtatgttgt ctcaagaaga aaaagacgac 900 
ctaggaacag gaatgcactt ttggggaaag 960 
ggactaatag aacattattc tgcaaaaact 1020 
ttggcccaac cttgcggttc ccagggctta 1080 
aacgaggatg tgagacaagt ggtttcctga 1140 
gctctgagtg ttctattttc ctatgttctt 1200 
ttatgtaaac caagatataa aagagtgctg 1260 
cattcacctc ttgtgtgttt gtgtctgttc 1320 
tcactttcca gagggtcccc ccgcagaccc 1380 
gctggcgccc gaacagggac cctcggataa 1440 
tgtttgtctt gtattgtctc tttcttgtct 1500 
ccatagggac caagctagcg cttctcgtcg 1560 
tcgttgagcg aggcgatatc v aggtatgaca. 1620 
gctcggctgc ggttggcacg gcaggatagg 1680 
taggtggcaa gcacctctgg-eacggcaaat : 1740 
tcgcatgtgc cgttttcttg gcgtttgggg 1800 
taggcaaggc tgacatccgc tatggcgagg 1860 
cagataatgg cgcactggcg ctgcagatgc 1920 
tagtcgccat gcctttcgtc cccccgcccg 1980 
tggtcttgct ttttatcctc tgttggtact 2040 
cctgggtcct gctcgataat cacttcctcc 2100 
gtggtaggcg cgttggcggc atcggtggag 2160 
aggctgtcct tcttctcgac tgactccatg 2220 
gccagtcggg aagaggagca gcgcgaaacc 2280 
cgtcccccaa ccatggagga cgtgtcgtcc 2340 
cccccaaaaa agcggatgag gcggcgtatc 2400 
gacgcgctgg tgccgcgcac acccagcccg 2460 
attgcgccca agaagaaaaa gaagcgccct 2520 
gaggtaatcg tggacagcga ggaagaaaga 2580 
agcaacccac cggtgctaat caagcatggc 2640 
aatgaagacg acccagtggc gcgtggtatg 2700 
gaagcgg.aaa gtgaaattac ggtgatgaac 2760 
gagaagggca tggaggctgc gcgcgcgctg 2820 
aaggcgaact tcaaactact gcctgaccaa 2880 
tggctgaacg aggagcaccg cgggttgcag 2940 
acgatgatgg ggcgattcct gcaggcgtac 3000 
catcacgagc ccacgggctg cgcgttgtgg 3060 
cttaagtgtc tacacggaag cattatgata 3120 
acgagcgaaa acgggcagcg cgcgctgaag 3180 
aaccggtggg gccgaaatgt ggtgcagatc 3240 
gacgcggcct gtccggccaa tcagttttcc 3300 
ggcgcaaagg ctcaggtggc ttttaagcag 3360 
aacgcccaga ccgggcacgg tcaccttttg 3420 
cctgggcacg cgcccttttt gggaaggcag 3480 
aacgcggagg acctggacgc ggatctgatc 3540 
cacccggcgc tgatagtgtt ccagtgctgc 3600 
ggcggaggcc ccaactgcga cttcaagata 3660 
atggtgcgca gcctgtggag tgaaaacttc 3720 
tttaagtgga gcactaaaca ccagtatcgc 3780 
gcgcggcaga acccctttga tttttaaacg 3840 
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gcgcagacgg caagggtggg ggtaaataat 
cctttattga aagtgtctct agtacattat 
gtggcgctcc taatctgcgc actgtggctg 
ctgtagagct gttcctggtt gcgacgcagg 
gagttgggta ccccggtaat aaggttcatg 
cagttggcaa aggcgtggag aaacatgcag 
ccctgtacgc tttgggtgga cttttccagc 
gcgctacggc gcaggagtga ctcgtactca 
gaaaagccaa agggctcaaa gaggtagcat 
atccagtgta cgcccccagt ctcgcgaccg 
tgtggagaaa caaagcctgg aaagcgcttg 
ccaagatctt tgacaatggc tttcagttcc 
gttgatgttg cttgcttctt tatgttgtgg 
acacggtttc gatgacgccg cggtgcggcc 
acaaaacata aagaagggtg ggctcgtcca 
ttacctcagg tcgacctcga gggatctttg 
aattggacaa actacctaca gagatttaaa 
tataatgtgt taaactactg attctaattg 
ctgatgaatg ggagcagtgg tggaatgcct 
aaatgccatc tagtgatgat gaggctactg 
agaagagaaa ggtagaagac cccaaggact 
atgctgtgtt tagtaataga actcttgctt 
ctgcactgct atacaagaaa[attatggaaa 
acagt tataa : ,t cataacat a - ctgt 1 1 1 1 1 c 
ttaataacta, tgctcaaaaa ttgtgtacct 
aggaatattitv.gatgtatagt gccttgacta 
gaggttttac .ttgctttaaa aaacctccca 
aatgcaattg ttgttgttaa cttgtttatt 
agcatcacaa atttcacaaa taaagcattt 
aaactcatca atgtatctta tcatgtctgg 
tgtggaaagt ccccaggctc cccagcaggc 
tcagcaacca ggtgtggaaa gtccccaggc 
catctcaatt agtcagcaac catagtcccg 
ccgcccagtt ccgcccattc tccgccccat 
gccgaggccg cctcggcctc tgagctattc 
ctaggctttt gcaaaaagct tcacgctgcc 
ggaagcggaa cacgtagaaa gccagtccgc 
gctactgggc tatctggaca agggaaaacg 
gtgggcttac atggcgatag ctagactggg 
tgccagctgg ggcgccctct ggtaaggttg 
tcttgccgcc aaggatctga tggcgcaggg 
gatcgtttcg catgattgaa caagatggat 
agaggctatt cggctatgac tgggcacaac 
tccggctgtc agcgcagggg cgcccggttc 
tgaatgaact gcaggacgag gcagcgcggc 
gcgcagctgt gctcgacgtt gtcactgaag 
tgccggggca ggatctcctg tcatctcacc 
ctgatgcaat gcggcggctg catacgcttg 
cgaaacatcg catcgagcga gcacgtactc 
atctggacga agagcatcag gggctcgcgc 
gcatgcccga cggcgaggat ctcgtcgtga 
tggtggaaaa tggccgcttt tctggattca 
gctatcagga catagcgttg gctacccgtg 
ctgaccgctt cctcgtgctt tacggtatcg 
atcgccttct tgacgagttc ttctgagcgg 
gacgcccaac ctgccatcac gagatttcga 
cttcggaatc gttttccggg acgccggctg 
ggagttcttc gcccaccccg ggctcgatcc 
gctggacgac ctcgcggagt tctaccggca 
cagcggctat ccgcgcatcc atgcccccga 
tttggtcccg gatctttgtg aaggaacctt 
tacctacaga gatttaaagc tctaaggtaa 



cacccgagag tgtacaaata aaagcatttg 3900 
ttttacatgt ttttcaagtg acaaaaagaa 3960 
cggaagtagg gcgagtggcg ctccaggaag 4020 
gtgggctgta cctggggact gttgagcatg 4080 
gtggggttgt gatccatggg agtttggggc 4140 
cagaatagtc cacaggcggc cgagttgggc 4200 
gttatacagc ggtcggggga agaagcaatg 4260 
aactggtaaa cctgcttgag tcgctggtca 4320 
gtttttgagt gcgggttcca ggcaaaggcc 4380 
gccgtattga ctatggcgca ggcgagcttg 4440 
tcataggtgc ccaaaaaata tggcccacaa 4500 
tgctcactgg agcccatggc ggcagctgtt 4560 
cgttgccggc cgagaagggc gtgcgcaggt 4620 
ggtgcacacg gaccacgtca aagacttcaa 4680 
tgggatccat atatagggcc cgggttataa 4740 
tgaaggaacc ttacttctgt ggtgtgacat 4800 
gctctaaggt aaatataaaa tttttaagtg 4860 
tttgtgtatt ttagattcca acctatggaa 4920 
ttaatgagga aaacctgttt tgctcagaag 4980 
ctgactctca acattctact cctccaaaaa 5040 
ttccttcaga attgctaagt tttttgagtc 5100 
gctttgctat ttacaccaca aaggaaaaag 5160 
aatattctgt aacctttata agtaggcata 5220 
ttactccaca caggcataga gtgtctgcta 5280 
ttagcttttt aatttgtaaa ggggttaata 5340 • , r . 
gagatcataa tcagccatac cacatttgta 5400 
cacctccccc tgaacctgaa acataaaatg 5460 
gcagcttata atggttacaa ataaagcaat 5520 
ttttcactgc attctagttg tggtttgtcc 5580 
atccggctgt ggaatgtgtg tcagttaggg 5640 
agaagtatgc aaagcatgca tctcaattag 5700 
tccccagcag gcagaagtat gcaaagcatg 5760 
cccctaactc cgcccatccc gcccctaact 5820 
ggctgactaa ttttttttat ttatgcagag 5880 
cagaagtagt gaggaggctt ttttggaggc 5940 
gcaagcactc agggcgcaag ggctgctaaa 6000 
agaaacggtg ctgaccccgg atgaatgtca 6060 
caagcgcaaa gagaaagcag gtagcttgca 6120 
cggttttatg gacagcaagc gaaccggaat 6180 
ggaagccctg caaagtaaac tggatggctt 6240 
gatcaagatc tgatcaagag acaggatgag 6300 
tgcacgcagg ttctccggcc gcttgggtgg 6360 
agacaatcgg ctgctctgat gccgccgtgt 6420 
tttttgtcaa gaccgacctg tccggtgccc 6480 
tatcgtggct ggccacgacg ggcgttcctt 6540 
cgggaaggga ctggctgcta ttgggcgaag 6600 
ttgctcctgc cgagaaagta tccatcatgg 6660 
atccggctac ctgcccattc gaccaccaag 6720 
ggatggaagc cggtcttgtc gatcaggatg 6780 
cagccgaact gttcgccagg ctcaaggcgc 6840 
cccatggcga tgcctgcttg ccgaatatca 6900 
tcgactgtgg ccggctgggt gtggcggacc 6960 
atattgctga agagcttggc ggcgaatggg 7020 
ccgctcccga ttcgcagcgc atcgccttct 7080 
gactctgggg ttcgaaatga ccgaccaagc 7140 
ttccaccgcc gccttctatg aaaggttggg 7200 
gatgatcctc cagcgcgggg atctcatgct 7260 
cctcgcgagt tggttcagct gctgcctgag 7320 
gtgcaaatcc gtcggcatcc aggaaaccag 7380 
actgcaggag tggggaggca cgatggccgc 7440 
acttctgtgg tgtgacataa ttggacaaac 7500 
atataaaatt tttaagtgta taatgtgtta 7560 
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aactactgat tctaattgtt tgtgtatttt 
agcagtggtg gaatgccttt aatgaggaaa 
gtgatgatga ggctactgct gactctcaac 
tagaagaccc caaggacttt ccttcagaat 
gtaatagaac tcttgcttgc tttgctattt 
acaagaaaat tatggaaaaa tattctgtaa 
ataacatact gttttttctt actccacaca 
ctcaaaaatt gtgtaccttt agctttttaa 
tgtatagtgc cttgactaga gatcataatc 
gctttaaaaa acctcccaca cctccccctg 
gttgttaact tgtttattgc agcttataat 
ttcacaaata aagcattttt ttcactgcat 
gtatcttatc atgtctggat ccccaggaag 
ctctacttga gaggacattc caatcatagg 
aattaggtca cttaacaaaa aggaaattgg 
ggtaatttta aaatatctgg gaagtccctt 
cagcccacaa atgtcaacag cagaaacata 
accctgctca tcaagaagca ctgtggttgc 
attttcccca cctgtgtagg ttccaaaata 
aacccagcac tccactggat aagcattatc 
catctgctga ctgtcaactg tagcattttt 
cctgtagttt gctaacacac cctgcagctc 
aaaatttgac ccttgaatgg gttttccagc 
atgcaagttt aacatagcag ttaccccaat 
catcaaaata tttccacagg ttaagtcctc 
cgaaagggcc tcgtgatacg cctattttta 
tagacgtcag gtggcacttt tcggggaaat 
taaatacatt caaatatgta tccgctcatg 
tattgaaaaa ggaagagtat gagtattcaa 
gcggcatttt gccttcctgt ttttgctcac 
gaagatcagt tgggtgcacg agtgggttac 
cttgagagtt ttcgccccga agaacgtttt 
tgtggcgcgg tattatcccg tgttgacgcc 
tattctcaga atgacttggt tgagtactca 
atgacagtaa gagaattatg cagtgctgcc 
ttacttctga caacgatcgg aggaccgaag 
gatcatgtaa ctcgccttga tcgttgggaa 
gagcgtgaca ccacgatgcc tgcagcaatg 
gaactactta ctctagcttc ccggcaacaa 
gcaggaccac ttctgcgctc ggcccttccg 
gccggtgagc gtgggtctcg cggtatcatt 
cgtatcgtag ttatctacac gacggggagt 
atcgctgaga taggtgcctc actgattaag 
tatatacttt agattgattt aaaacttcat . 
ctttttgata atctcatgac caaaatccct 
gaccccgtag aaaagatcaa aggatcttct 
tgcttgcaaa caaaaaaacc accgctacca 
ccaactcttt ttccgaaggt aactggcttc 
ctagtgtagc cgtagttagg ccaccacttc 
gctctgctaa tcctgttacc agtggctgct 
ttggactcaa gacgatagtt accggataag 
tgcacacagc ccagcttgga gcgaacgacc 
ctatgagaaa gcgccacgct tcccgaaggg 
agggtcggaa caggagagcg cacgagggag 
agtcctgtcg ggtttcgcca cctctgactt 
99gcggagcc tatggaaaaa cgccagcaac 
tggccttttg ctcacatgtt ctttcctgcg 
accgcctttg agtgagctga taccgctcgc 
gtgagcgagg aagcggaaga gcgcctgatg 
atttcacacc gcatatggtg cactctcagt 
cagtatctgc tccctgcttg tgtgttggag 
gctacaacaa ggcaaggctt gaccgacaat 



agattccaac ctatggaact gatgaatggg 7620 
acctgttttg ctcagaagaa atgccatcta 7680 
attctactcc tcca!aaaaag aagagaaagg 7740 
tgctaagttt tttgagtcat gctgtgttta 7800 
acaccacaaa ggaaaaagct gcactgctat 7860 
cctttataag taggcataac agttataatc 7920 
ggcatagagt gtctgctatt aataactatg 7980 
tttgtaaagg ggttaataag gaatatttga 8040 
agccatacca catttgtaga ggttttactt 8100 
aacctgaaac ataaaatgaa tgcaattgtt 8160 
ggttacaaat aaagcaatag catcacaaat 8220 
tctagttgtg gtttgtccaa actcatcaat 8280 
ctcctctgtg tcctcataaa ccctaacctc 8340 
ctgcccatcc accctctgtg tcctcctgtt 8400 
gtaggggttt ttcacagacc gctttctaag 84 60 
ccactgctgt gttccagaag tgttggtaaa 8520 
caagctgtca gctttgcaca agggcccaac 8580 
tgtgttagta atgtgcaaaa caggaggcac 8640 
tctagtgttt tcatttttac ttggatcagg 8700 
cttatccaaa acagccttgt ggtcagtgtt 8760 
tggggttaca gtttgagcag gatatttggt 8820 
caaaggttcc ccaccaacag caaaaaaatg 8880 
accattttca tgagtttttt gtgtccctga. 8940 
aacctcagtt,.ttaacagtaa cagcttccca 9000 
atttaaatta. ggcaaaggaa ttcttgaaga 9060 
taggttaatg tcatgataat aatggtttct 9120 
gtgcgcggaa cccctatttg tttatttttc 9180 
agacaataac cctgataaat gcttcaataa 9240 
catttccgtg tcgcccttat tccctttttt 9300 
ccagaaacgc tggtgaaagt aaaagatgct 9360 
atcgaactgg atctcaacag cggtaagatc 942 0 
ccaatgatga gcacttttaa agttctgcta 9480 
gggcaagagc aactcggtcg ccgcatacac 9540 
ccagtcacag aaaagcatct tacggatggc 9600 
ataaccatga gtgataacac tgcggccaac 9.660 
gagctaaccg cttttttgca caacatgggg 9720 
ccggagctga atgaagccat accaaacgac 9780 
gcaacaacgt tgcgcaaact attaactggc 9840 
ttaatagact ggatggaggc ggataaagtt 9900 
gctggctggt ttattgctga taaatctgga 9960 
gcagcactgg ggccagatgg taagccctcc 10020 
caggcaacta tggatgaacg aaatagacag 10080 
cattggtaac tgtcagacca agtttactca 10140 
ttttaattta aaaggatcta ggtgaagatc 10200 
taacgtgagt tttcgttcca ctgagcgtca 10260 
tgagatcctt tttttctgcg cgtaatctgc 10320 
gcggtggttt gtttgccgga tcaagagcta 10380 
agcagagcgc agataccaaa tactgtcctt 10440 
aagaactctg tagcaccgcc tacatacctc 10500 
gccagtggcg ataagtcgtg tcttaccggg 10560 
gcgcagcggt cgggctgaac ggggggttcg 10620 
tacaccgaac tgagatacct acagcgtgag 10680 
agaaaggcgg acaggtatcc ggtaagcggc 10740 
cttccagggg gaaacgcctg gtatctttat 10800 
gagcgtcgat ttttgtgatg ctcgtcaggg 10860 
gcggcctttt tacggttcct ggccttttgc 10920 
ttatcccctg attctgtgga taaccgtatt 10980 
cgcagccgaa cgaccgagcg cagcgagtca 11040 
cggtattttc tccttacgca tctgtgcggt 11100 
acaatctgct ctgatgccgc atagttaagc 11160 
gtcgctgagt agtgcgcgag caaaatttaa 11220 
tgcatgaaga atctgcttag ggttaggcgt 11280 
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tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 11400 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 11460 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc 11520 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 11580 
catggattgg acgaaccact , 11600 

<210> 36 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 53 

<210> 37 
<211> 36 
<212> DNA • 

<213> Artificial Sequence - v 

<22Q> > : '■ •■ * > . :;i-/i*J-:-^--, 

$<223> Description of Artificial Sequence: primer I- 

<400> 37 

cgagatcgat cacctccggt acaaggtttg gcatag 36 

<210> 38 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

catgaagatc tggaaggtgc tgaggtacga tgagacc 37 

<210> 39 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 51 

<210> 40 
<211> 38 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 40 

cacgaattcg tcagcgcttc tcgtcgcgtc caagaccc 
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<210> 41 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

caccccgggg aggcggcggc gacggggacg gg 32 

<210> 42 
<211> 7231 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid pDV80 
<400> 42 

ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca 60 
acaaggcaag gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg 120 
ctgcttcgcg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 180 
tagtaatcaa. ttacggggtc attagttcat.agcccatata tggagttccg cgttacataa 240 
cttacggtaa atggcccgcc tggctgac.cg: eccaacgacc cccgcccatt gacgtcaata 300 
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 360 
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 420 
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 480 
tgggactttc ctacttggca gtacatctac gtattagt-ca tcgctattac catggtgatg 540 
cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 600 
ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 660 
aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 720 
gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 780 
attaatacga ctcactatag ggagacccaa gctggctagc gtttaaactt aagcttggta 840 
ccgagctcgg atccactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag 900 
tactccctct gaaaagcggg catgacttct gcgctaagat tgtcagtttc caaaaacgag 960 
gaggatttga tattcacctg gcccgcggtg atgcctttga gggtggccgc atccatctgg 1020 
tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa acgacccgta gagggcgttg 1080 
gacagcaact tggcgatgga gcgcagggtt tggtttttgt cgcgatcggc gcgctccttg 1140 
gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc gccattcggg aaagacggtg 1200 
gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt tgtgcagggt gacaaggtca 1260 
acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc agcagaggcg gccgcccttg 1320 
cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt ccggggggtc tgcgtccacg 1380 
gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta tcttgcatcc ttgcaagtct .1440 
agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt atgggttgag tgggggaccc 1500 
catggcatgg ggtgggtgag cgcggaggcg tacatgccgc aaatgtcgta aacgtagagg 1560 
ggctctctga gtattccaag atatgtaggg tagcatcttc caccgcggat gctggcgcgc 1620 
acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg gaccgaggtt gctacgggcg 1680 
ggctgctctg ctcggaagac tatctgcctg aagatggcat gtgagttgga tgatatggtt 1740 
ggacgctgga agacgttgaa gctggcgtct gtgagaccta ccgcgtcacg cacgaaggag 1800 
gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga cctgcacgtc tagggcgcag 1860 
tagtccaggg tttccttgat gatgtcatac ttatcctgtc cctttttttt ccacagctcg 1920 
cggttgagga caaactcttc gcggtctttc cagtactctt ggatcggaaa cccgtcggcc 1980 
tccgaacgag atccgtactc cgccgccgag ggacctgagc gagtccgcat cgaccggatc 2040 
ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg caagatccaa gatgaagcgc 2100 
gcccgcccca gcgaagatga cttcaacccc gtctacccct atggctacgc gcggaatcag 2160 
aatatcccct tcctcactcc cccctttgtc tcctccgatg gattcaaaaa cttcccccct 2220 
ggggtactgt cactcaaact ggctgatcca atcaccatta ccaatgggga tgtatccctc 2280 
aaggtgggag gtggtctcac tttgcaagat ggaagcctaa ctgtaaaccc taaggctcca 2340 
ctgcaagtta atactgataa aaaacttgag cttgcatatg ataatccatt tgaaagtagt 2400 
gctaataaac ttagtttaaa agtaggacat ggattaaaag tattagatga aaaaagtgct 2460 
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gcggggttaa aagatttaat tggcaaactt 
gaaaatttag aaaatacaga tggtagcagc 
gaagggttga catttgacaa tgatggatac 
cgcacacttt ggacaacacc agacacatct 
tctaaactca ctttggtact tacaaagtgt 
attgtggtcg caggaaagta ccacatcata 
tttactatta aactgctatt taataagaac 
aaagcttatt ggaactttag aagtggaaat 
attggtttta tgcctaattt ggtagcgtat 
agagacatag tttatggaac tatatatctt 
aaaactacct ttaaccaaga aactggatgt 
tccaaaacct atgaaaatgt tgaatttgaa 
caagaatgaa agagcggccg ctcgagtcta 
cgactgtgcc ttctagttgc cagccatctg 
ccctggaagg tgccactccc actgtccttt 
gtctgagtag gtgtcattct attctggggg 
attgggaaga caatagcagg catgctgggg 
aaagaaccag ctggggctct agggggtatc 
cggcgggtgt ggtggttacg cgcagcgtga 
ctcctttcgc tttcttccct tcctttctcg 
taaatcgggg catcccttta gggttccgat 
aacttgatta gggtgatggt tcacgtagtg 
ctt'tgacgtt ggagtccacg ttctttaata 
/tcaaccctat ctcggtctat tcttttgatt 
ggttaaaaaa tgagctgatt taacaaaaat 
tcagttaggg tgtggaaagt ccccaggctc 
atctcaatta gtcagcaacc aggtgtggaa 
tgcaaagcat gcatctcaat tagtcagcaa 
cgcccctaac tccgcccagt tccgcccatt 
tttatgcaga ggccgaggcc gcctctgcct 
tttttggagg cctaggcttt tgcaaaaagc 
ctgatcagca cgtgttgaca attaatcatc 
caaggtgagg a-actaaacca tggccaagtt 
cgacgtcgcc ggagcggtcg agttctggac 
ggaggacgac ttcgccggtg tggtccggga 
ggaccaggtg gtgccggaca acaccctggc 
gtacgccgag tggtcggagg tcgtgtccac 
gaccgagatc ggcgagcagc cgtgggggcg 
ctgcgtgcac ttcgtggccg aggagcagga 
cgccgccttc tatgaaaggt tgggcfctcgg 
cctccagcgc ggggatctca tgctggagtt 
ttataatggt tacaaataaa gcaatagcat 
actgcattct agttgtggtt tgtccaaact 
gtcgacctct agctagagct tggcgtaatc 
ttatccgctc acaattccac acaacatacg 
tgcctaatga gtgagctaac tcacattaat 
gggaaacctg tcgtgccagc tgcattaatg 
gcgtattggg cgctcttccg cttcctcgct 
gcggcgagcg gtatcagctc actcaaaggc 
taacgcagga aagaacatgt gagcaaaagg 
cgcgttgctg gcgtttttcc ataggctccg 
ctcaagtcag aggtggcgaa acccgacagg 
aagctccctc gtgcgctctc ctgttccgac 
tctcccttcg ggaagcgtgg cgctttctca 
gtaggtcgtt cgctccaagc tgggctgtgt 
cgccttatcc ggtaactatc gtcttgagtc 
ggcagcagcc actggtaaca ggattagcag 
cttgaagtgg tggcctaact acggctacac 
gctgaagcca gttaccttcg gaaaaagagt 
cgctggtagc ggtggttttt ttgtttgcaa 
tcaagaagat cctttgatct tttctacggg 
ttaagggatt ttggtcatga gattatcaaa 



gtggttttaa caggaaaagg aataggcact 2520 
agaggaattg gtataaatgt aagagcaaga 2580 
ttggtagcat ggaacccaaa gtatgacacg 2640 
ccaaactgca caattgctca agataaggac 2700 
ggaagtcaaa tattagctaa tgtgtctttg 2760 
aataataaga caaatccaaa aataaaaagt 2820 
ggagtgcttt tagacaactc aaatcttgga 2880 
tccaatgttt cgacagctta tgaaaaagca 2940 
ccaaaaccca gtaattctaa aaaatatgca 3000 
ggtggaaaac ctgatcagcc agcagtcatt 3060 
gaatactcta tcacatttaa ctttagttgg 3120 
accacctctt ttaccttctc ctatattgcc 3180 
gagggcccgt ttaaacccgc tgatcagcct 3240 
ttgtttgccc ctcccccgtg ccttccttga 3300 
cctaataaaa tgaggaaatt gcatcgcatt 3360 
gtggggtggg gcaggacagc aagggggagg 3420 
atgcggtggg ctctatggct tctgaggcgg 3480 
cccacgcgcc ctgtagcggc gcattaagcg 3540 
ccgctacact tgccagcgcc ctagcgcccg 3 600 
ccacgttcgc cggctttccc cgtcaagctc 3660 
ttagtgcttt acggcacctc gaccccaaaa 3720 
ggccatcgcc ctgatagacg gtttttcgcc .3780 
gtggactctt gttccaaact ggaacaacac ; . ; 3.|40 
tataagggat tttggggatt tcggcctatt 3900 
ttaacgcgaa ttaattctgt grgaatgtgtg 3960 
cccaggcagg cagaagt atg v caa^gcatgc 4020 
agtccccagg ctccccagca ggcagaagta 4080 
ccatagtccc gcccctaact ccgcccatcc 4140 
ctccgcccca tggctgacta atttttttta 4200 
ctgagctatt ccagaagtag tgaggaggct 4260 
tcccgggagc ttgtatatcc attttcggat 4320 
ggcatagtat atcggcatag tataatacga 4380 
gaccagtgcc gttccggtgc tcaccgcgcg 4440, 
cgaccggctc gggttctccc gggacttcgt 4500 
cgacgtgacc ctgttcatca gcgcggtcca 4560 
ctgggtgtgg gtgcgcggcc tggacgagct 4620 
gaacttccgg gacgcctccg ggccggccat 4680 
ggagttcgcc ctgcgcgacc cggccggcaa 4740 
ctgacacgtg ctacgagatt tcgatt.ccac 4800 
aatcgttttc cgggacgccg gctggatgat 4860 
cttcgcccac cccaacttgt ttattgcagc 4920 
cacaaatttc acaaataaag catttttttc 4980 
catcaatgta tcttatcatg tctgtatacc 5040 
atggtcatag ctgtttcctg tgtgaaattg 5100 
agccggaagc ataaagtgta aagcctgggg 5160 
tgcgttgcgc tcactgcccg ctttccagtc 5220 
aatcggccaa cgcgcgggga gaggcggttt 5280 
cactgactcg ctgcgctcgg tcgttcggct 5340 
ggtaatacgg ttatccacag aatcagggga 54 00 
ccagcaaaag gccaggaacc gtaaaaaggc 5460 
cccccctgac gagcatcaca aaaatcgacg 5520 
actataaaga taccaggcgt ttccccctgg 5580 
cctgccgctt accggatacc tgtccgcctt 5640 
atgctcacgc tgtaggtatc tcagttcggt 5700 
gcacgaaccc cccgttcagc ccgaccgctg 5760 
caacccggta agacacgact tatcgccact 5820 
agcgaggtat gtaggcggtg ctacagagtt 5880 
tagaaggaca gtatttggta tctgcgctct 5940 
tggtagctct tgatccggca aacaaaccac 6000 
gcagcagatt acgcgcagaa aaaaaggatc 6060 
gtctgacgct cagtggaacg aaaactcacg 6120 
aaggatcttc acctagatcc ttttaaatta 6180 
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aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 6240 
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 6300 
ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 6360 
tgcaatgataccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 6420 
agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat 6480 
taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt 6540 
tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc 6600 
cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 6660 
ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt 6720 
tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 6780 
tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 6840 
cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat 6900 
tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 6960 
gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 7020 
tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 7080 
atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg 7140 
tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 7200 
cacatttccc cgaaaagtgc cacctgacgt c 7231 

<210> 43 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> ;;.^-; v v,.'v 

<223> Description;, of- Artificial Sequence: primer 
<400> 43 

tgtcttggat ccaagatgaa gcgcgcccgc cccagcgaag atgacttc 48 



<210> 44 
<211> 28 
<212> DNA 

<213> Artificial. Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 44 

aaacacggcg gccgctcttt cattcttg ' 28 

<210> 45 
<211> 17 
<212> PRT 

<213> Ad 37 N-terminus 
<400> 45 

Met Ser Lys Arg- Leu Arb Val Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
1 5 10 15 

<210> 46 
<211> 6 
<212> PRT 

<213> artificial sequence 
<400> 46 

Lys Arg Ala Arg Pro Ser 
15 



v <210> 47 
<211> 17 
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<212> PRT 

<213> Ad5 modified N- terminus 
<400> 47 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 



<210> 48 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 48 ' ' 

ggatccatgg gatacttggt agca 24 

<210> 49 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> ( 

<223> Description of Artificial Sequence: pfimerV : \ ■ 1 



<400> 49 

gcaactcgag tcattcttgg gcaatatagg 

<210> 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 50 

cgcgctgact cttaaggact agtttc 25 

<210> 51 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 51 

gcgcttaatt aacatcatca ataatatacc ttatttt 
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